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Abstract 



Given two rooted, ordered, and labeled trees P and T the tree inclusion problem is to determine 
if P can be obtained from T by deleting nodes in T. This problem has recently been recognized as 
an important query primitive in XML databases. Kilpelainen and Mannila [SIAM J. Comput. 1995] 
presented the first polynomial time algorithm using quadratic time and space. Since then several improved 
results have been obtained for special cases when P and T have a small number of leaves or small depth. 
However, in the worst case these algorithms still use quadratic time and space. Let ns, Is, and ds denote 
the number of nodes, the number of leaves, and the depth of a tree S £ {P,T}. In this paper we show 
that the tree inclusion problem can be solved in space 0{nT) and time: 



This improves or matches the best known time complexities while using only linear space instead of 
quadratic. This is particularly important in practical applications, such as XML databases, where the 
space is likely to be a bottleneck. 

1 Introduction 

Let T be a rooted tree. We say that T is labeled if each node is assigned a character from an alphabet E 
and we say that T is ordered if a left-to-right order among siblings in T is given. All trees in this paper are 
rooted, ordered, and labeled. A tree P is included in T, denoted P C T, if P can be obtained from T by 
deleting nodes of T. Deleting a node w in T means making the children of v children of the parent of v and 
then removing v. The children are inserted in the place of v in the left-to-right order among the siblings of 
V. The tree inclusion problem is to determine if P can be included in T and if so report all subtrees of T 
that include P. 

Recently, the problem has been recognized as an important query primitive for XML data and has 
received considerable attention, see e.g., [33,34,37,39-41]. The key idea is that an XML document can 
be viewed as a tree and queries on the document correspond to a tree inclusion problem. As an example 
consider Figure [1] Suppose that we want to maintain a catalog of books for a bookstore. A fragment of the 
tree, denoted I?, corresponding to the catalog is shown in (b). In addition to supporting full-text queries, 
such as find all documents containing the word " John" , we can also utilize the tree structure of the catalog 
to ask more specific queries, such as "find all books written by John with a chapter that has something to 
do with XML". We can model this query by constructing the tree, denoted Q, shown in (a) and solve the 
tree inclusion problem: is Q C Dl The answer is yes and a possible way to include Q in Z) is indicated 

*An extended abstract of this paper appeared in Proceedings of the 32nd International Colloquium on Automata, Languages 
and Programming, Lecture Notes in Computer Science, vol. 3580, pp. 66-77, Springer- Verlag, 2005. 

tTechnical University of Denmark, Department of Informatics and Mathematical Modelling. This work is part of the DSSCV 
project supported by the 1ST Programme of the European Union (IST-2001-35443). 

f Corresponding author: Technical University of Denmark, Department of Informatics and Mathematical Modelling, Build- 
ing 322, Office 124, DK-2800 Kongens Lyngby, Denmark. Phone: (+45) 45 25 36 73. Fax: (-1-45) 45 88 26 73. Email: 
ilgSimm . dtu . dk. 




1 




Q D catalog 

I 

^^^book^^^ ^book 

author chapter author chapter 

John XML name title section 

John databases XML 

(a) (b) 

catalog 

1 ' 

^^^book^^^^ ^ " "jL^book 

author, chapter _ . .author chapter 

John XML. . _ name title section 

~ ■ - - . _ _ . . -John databases " " "XML 

(c) 



queries 




chapter 
I 

title 
queries 



Figure 1: (a) The tree Q corresponding to the query, (b) A fragment of the tree D. Can the tree Q be 
included in the tree D7 It can and an embedding is given in (c). 



by the dashed lines in (c). If we delete all the nodes in D not touched by dashed lines the trees Q and 
D become isomorphic. Such a mapping of the nodes from Q to D given by the dashed lines is called an 
embedding (formally defined in Section [3]). We note that the ordering of the XML document, and hence the 
left-to-right order of siblings, is important in many cases. For instance, in the above example, the relative 
order of contents of the chapters is most likely important. Also, in biological databases, order is of critical 
importance. Consequently, standard XML query languages, such as XPath [13] and XQuery [10], require 
the output of queries to be ordered. 

The tree inclusion problem was initially introduced by Knuth [27, exercise 2.3.2-22] who gave a sufficient 
condition for testing inclusion. Motivated by applications in structured databases [24, 29] Kilpelainen and 
Mannila [25] presented the first polynomial time algorithm using 0{npnT) time and space, where np and 
riT is the number of nodes in P and T, respectively. During the last decade several improvements of the 
original algorithm of [25] have been suggested [1,11,23,32]. The previously best known bound is due to 
Chen [11] who presented an algorithm using 0{lpnT) time and 0{lp ■ min{rfT,^T}) space. Here, Is and ds 
denote the number of leaves and the depth of a tree S, respectively. This algorithm is based on an algorithm 
of Kilpelainen [23]. Note that the time and space is still QinpriT) for worst-case input trees. 

In this paper we present three algorithms which combined improve all of the previously known time and 
space bounds. To avoid trivial cases we always assume that 1 < np < hit- We show the following theorem: 

Theorem 1 For trees P and T the tree inclusion problem can be solved in 0{nT) space with the following 
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running time: 

IpTlT 

Iplr log log ut + ut 

Hence, when P has few leaves we obtain a fast algorithm and even faster if both P and T have few leaves. 
When both trees have many leaves and np = Vt(\o^nT), we instead improve the previous quadratic time 
bound by a logarithmic factor. Most importantly, the space used is linear. In the context of XML databases 
this will likely make it possible to query larger trees and speed up the query time since more of the compu- 
tation can be kept in main memory. 

The extended abstract of this paper [9] contained an error. The algorithms in the paper [9] did not use 
linear space. The problem was due to a recursive traversal of P which stored too many sets of nodes leading 
to a worst-case space complexity of ^{dplx)- In this paper we fix this problem by recursively visiting the 
nodes such that the child with the largest number of descendant leaves is visited first, and by showing that 
the size of the resulting stored node sots exponentially decrease. With these ideas we show that all of our 
algorithms use O(nT) space. Additionally, our improved analysis of the sizes of the stored node sets also 
leads to an improvement in the running time of the algorithm in the second case above. In the previous 
paper the running time was 0(nj,ZTloglognT + ut)- 



1.1 Techniques 

Most of the previous algorithms, including the best one [11], are essentially based on a simple dynamic 
programming approach from the original algorithm of [25]. The main idea behind this algorithm is the 
following: Let w be a node in P with children vi, . . . ,Vi and let w be a node in T . Consider the subtrees 
rooted at v and w, denoted by P{v) and T{w). To decide if P{v) can be included in we try to find a 

sequence wi, . . . , of left-to-right ordered descendants of w such that P{vk) E T{u)k) for all fc, 1 < fc < i. 
The sequence is computed greedily from left-to-right in T{w) effectively finding the leftmost inclusion of 
P{v) in T{w). Applying this approach in a bottom-up fashion we can determine, if P{v) C T{w), for all 
pairs of nodes u in P and w in T. 

In this paper we take a different approach. The main idea is to construct a data structure on T supporting 
a small number of procedures, called the set procedures, on subsets of nodes of T. We show that any such 
data structure implies an algorithm for the tree inclusion problem. We consider various implementations 
of this data structure which all use linear space. The first simple implementation gives an algorithm with 
0{lpnT) running time. As it turns out, the running time depends on a well-studied problem known as 
the tree color problem. We show a direct connection between a data structure for the tree color problem 
and the tree inclusion problem. Plugging in a data structure of Dietz [16] we obtain an algorithm with 
0{IpIt ^oglogUT + Ut) running time. 

Based on the simple algorithms above we show how to improve the worst-case running time of the set 
procedures by a logarithmic factor. The general idea used to achieve this is to divide T into small trees called 
clusters of logarithmic size which overlap with other clusters in at most 2 nodes. Each cluster is represented 
by a constant number of nodes in a macro tree. The nodes in the macro tree are then connected according to 
the overlap of the cluster they represent. Wc show how to efficiently preprocess the clusters and the macro 
tree such that the set procedures use constant time for each cluster. Hence, the worst-case quadratic running 
time is improved by a logarithmic factor. 

Our algorithms recursively traverse P top-down. For each node v G V{P) we compute a set of nodes 
representing all of the subtrees in T that include P{v). To avoid storing too many of these node sets the 
traversal of P visits the child with the largest number of descendant leaves first. For the first two algorithms 
this immediately implies a space complexity of 0{lT^oglp), however, by carefully analyzing the sizes of 
stored node sets we are able to show that they decrease exponentially leading to the linear space bound. In 
the last algorithm the node sets are compactly encoded in O {ut/ log ut) space and therefore our recursive 
traversal alone implies a space bound of 0{nT/ log ut ■ logZp) = 0{nT). 
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Throughout the paper we assume a unit-cost RAM model of computation with word size Q{\ognT) and a 
standard instruction set including bitwise boolean operations, shifts, addition, and multiplication. All space 
complexities refer to the number of words used by the algorithm. 

1.2 Related Work 

For some applications considering unordered trees is more natural. However, in [25, 30] this problem was 
proved to be NP-complete. The tree inclusion problem is closely related to the tree pattern matching problem 
[14, 17,22,28]. The goal is here to find an injective mapping / from the nodes of P to the nodes of T 
such that for every node w in P the ith child of v is mapped to the zth child of f{v). The tree pattern 
matching problem can be solved in (rip + tit) log'^'-"'^-' {np + tit) time. Another similar problem is the subtree 
isomorphism problem [12,35], which is to determine if T has a subgraph isomorphic to P. The subtree 
isomorphism problem can be solved efficiently for ordered and unordered trees. The best algorithms for this 
problem use 0( "^^'J +nT) time for unordered trees and Q( i"g^p +"-t) time for ordered trees [12,35]. Both 
use 0{npnT) space. The tree inclusion problem can be considered a special case of the tree edit distance 
problem [15,26,36,42]. Here one wants to find the minimum sequence of insert, delete, and relabel operations 
needed to transform P into T. Currently the best algorithm for this problem uses 0(nT«p(l + log^)) 
time [15]. For more details and references see the survey [8]. 

1.3 Outline 

In Section [2] we give notation and definitions used throughout the paper. In Section [3] a common framework 
for our tree inclusion algorithms is given. Section |3] presents two simple algorithms and then, based on these 
results, we show how to get a faster algorithm in Section [5] 

2 Notation and Definitions 

In this section we define the notation and definitions we will use throughout the paper. For a graph G we 
denote the set of nodes and edges by V{G) and E{G), respectively. Let T be a rooted tree. The root of T 
is denoted by root(r). The size of T, denoted by ut, is |y(T)|. The depth of a node v G V{T), depth(w), is 
the number of edges on the path from v to root(T) and the depth of T, denoted dr, is the maximum depth 
of any node in T. The parent of v is denoted parent (u) and the set of children of v is denoted child(ti). We 
define parent (root (T)) — _L, where _L ^ V{T) is a special null node. A node with no children is a leaf and 
otherwise an internal node. The set of leaves of T is denoted L{T) and we define It = 1^(2^) |- We say that 
T is labeled if each node w is a assigned a character, denoted label(w), from an alphabet S and we say that 
T is ordered if a left-to-right order among siblings in T is given. Note that we do not require that the size 
of the alphabet is bounded by a constant. All trees in this paper are rooted, ordered, and labeled. 

Ancestors and Descendants Let T(w) denote the subtree of T rooted at a node w £ V{T). IfwG V{T{v)) 
then V is an ancestor of w, denoted v ^ w, and if w € V{T{v))\{v} then w is a proper ancestor of w, denoted 
V < w. If u is a (proper) ancestor of w then w is a (proper) descendant of w. A node z is a common ancestor 
of V and w if it is an ancestor of both v and w. The nearest common ancestor of v and w, nca(i', w), is the 
common ancestor of v and w of greatest depth. The first ancestor of w labeled a, denoted fl(w,a), is the 
node V such that v < label(u) = a, and no node on the path between v and w is labeled a. If no such 
node exists then fl(w, a) — _L. 

Traversals and Orderings Let T be a tree with root v and let wi , . . . , be the children of v from left- 
to-right. The preorder traversal of T is obtained by visiting v and then recursively visiting T{vi), 1 < i < fc, 
in order. Similarly, the postorder traversal is obtained by first visiting T{vi), 1 < « < fc, in order and then 
V. The preorder number and postorder number of a node w G T(v), denoted by pre(u') and post('u;), is the 
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(a) (b) 
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Figure 2: In (a) we have {{vi,V2,V3,ve,V7),{vi,V2,V5,ve,V7),{vi,V4,V5,V(i,vr),{v3,V4,V5,ve,vr)} = 
^{81,82, Si, S3, Si) and thus mop(5'i, 5*2, S*!, 5*3, S'4) = {{v3,V7)}. In (b) we have <I>(S'i, 6*2, S*!, S'3, 5*4) = 

{{vi,V2,V3,V5,V7), {vi,V2,Ve,V8,Vg), (wi , ■i;2 , V3, ^^S, Vq), {vi, V2, V3, V5, Vg) , {vi, V4, Vq, Vs, Vg) , {v3,V4,Ve,Vs,Vg)} 

and thus mop(S'i, S'2, 81,83, 84) = {(ui, W7), {v3,vg)}. 



number of nodes preceding w in the preorder and postorder traversal of T, respectively. The nodes to the 
left of w in T is the set of nodes u S V{T) such that pre(it) < pre(w) and post(u) < post(w). If u is to the 
left of w, denoted hy u <\ w, then w is to the right of m. If u <\ w ot u ^ w ot w ^ u we write u ^ w. The 
null node _L is not in the ordering, i.e., J- ^ v for all nodes v. 

Minimum Ordered Pairs A set of nodes X C V{T) is deep if no node in X is a proper ancestor of 
another node in X. For k deep sets of nodes Xi, . . . , Xk let ^{Xi, . . . , Xk) C [Xi x • • • x Xk), be the set of 
tuples such that (xi, . . . ,Xk) € $(^1, ■ . ■ ,Xk) iff a;i < • • • < Xk- If [xi, . . . ,Xk) G ^{Xi, . . . ,Xk) and there is 
no {x[, . . . ,x'i.) € $(^1, • . ■ , Xk), where either xi <ix[ <ix'f.^Xk or xi ^x'l < x'^. < Xk then the pair (xi, Xk) is a 
minimum ordered pair. Intuitively, {xi, Xk) is a closest pair of nodes from Xi and Xk in the left-to-right order 
for which we can find X2, ■ ■ ■ , Xk~i such that xi <\ - ■ ■ <ixk. The set of minimum ordered pairs for Xi, . . . , Xk 
is denoted by mop(Xi, . . . , Xk). Figure [2] illustrates these concepts on a small example. For any set of pairs 
Y, let Y\^ and ¥{2 denote the projection of Y to the first and second coordinate, that is, if {yi, j/2) G y then 
Ui G and 1/2 G Yl^- We say that Y is deep if Y\-^ and Fjj are deep. The following lemma shows that 
given deep sets Xi, . . . ,Xk we can compute mop(Xi, . . . ,Xk) iteratively by first computing mop(Xi, X2) 
and then mop(mop(Xi, X2)|2, ^3) and so on. 

Lemma 1 For any deep sets of nodes Xi, . . . , Xk, k > 2, we have, {xi, Xk) € mop(Xi, . . . , Xk) iff there exists 
a node Xk-i such that {xi,Xk-i) € mop(Xi, . . . ,Xk-i) and {xk-i,Xk) € mop(mop(Xi, . . . , Xk-i)\2, Xk). 

Proof. We start by showing that if {xi,Xk) G mop(Xi, . . . ,Xk) then there exists a node Xk-i such that 
{xi,Xk-i) e mop(Xi, . . . ,Xk-i) and {xk-i,Xk) S mop(mop(Xi, . . . , Xk-i)\2, Xk). 

First note that {zi,...,Zk) E ^{Xi,...,Xk) implies {zi, . . . , Zk-i) € ^{Xi, . . . , Xk^i). Since {xi,Xk) e 
mop(Xi, . . . , Xk) there must be a minimum node Xk-i such that the tuple (xi, . . . , Xk~i) is in $(^^^1, . . . , Xk-i) 
We have {xi,Xk-i) G mop(Xi, . . . , Xk~i). We need to show that {xk-i,Xk) G mop(mop(Xi, . . . , Xk~i)\2, Xk). 
Since {xi,Xk) G mop(Xi, . . . , Xk) there exists no node z G Xk such that Xk-i <i z <i Xk. If such a z existed 
we would have {xi, . . . ,Xk~i,z) G ^{Xi, . . . ,Xk), contradicting that {xi,Xk) G mop(Xi, . . . ,Xk). Assume 
there exists a node z G mop(A"i, . . . , Xk^i)\2 such that Xk-i <i z <\Xk. Since {xi, Xk-i) G mop(Ari, . . . , Xk-i) 
this implies that there is a node z' l> xi such that {z',z) G mop(Ari, . . . ,Xk-i). But this implies that the 
tuple (z', . . ■ , z, Xk) is in ^{Xi, . . . , Xk) contradicting that {xi,Xk) G mop(Xi, . . . , Xk). 

We will now show that if there exists a node Xk-i such that {xi,Xk^i) G mop{Xi, . . . , Xk-i) and 
{xk-i,Xk) G mop(mop(Ari, . . . , Xk-i)\2, Xk) then the pair {xi,Xk) G mop(Xi, . . . ,Xk). Clearly, there exists 
a tuple {xi, . . . ,Xk^i,Xk) G ^{Xi, . . . ,Xk). Assume that there exists a tuple {zi, . . . ,Zk) G ^{Xi, . . . ,Xk) 
such that xi <i zi <i Zk ^ Xk. Among the tuples satisfying these constraints let (j/i, . . . , yk~i, yk) be the one 
with maximum yi, minimum yk-i, and maximum yk. It follows that {yi,yk-i) G mop(Xi, . . . , 
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Since (xi,Xk-i) G niop(Xi, . . . , we must have Xk-i <\ yk-i- But this contradicts {xk-i,Xk) G 

mop(mop(Xi, . . . ,Xk-i)\^,Xk), since node yk-i S mop(Xi, . . .,Xk-i)\2- 

Assume that there exists a tuple (zi,...,Zfe) e such that xi < zi <\ Zk < Xk- Since 

(xijXfe-i) e mop(Xi. . . . we have x^-i < z^-i and thus a;fe_i <i Zk <i Xk contradicting {xk-i,Xk) G 

mop(mop(Xi, . . . ,Xi;_i)|2,Xfe). □ 

The following lemma is the reverse of the previous lemma and shows that given deep sets Xi , . . . , X^ we also 
can compute mop(Xi, . . . , Xk) iteratively from right to left. The proof is similar to the proof of Lemma[TJ 

Lemma 2 For any deep sets of nodes Xi, . . . ,Xk, k > 2, we have, {xi,Xk) G mop(Xi, . . . , Xk) iff there 
exists a node Xi such that (x2, Xk) G mop(X2, . . . , Xk) and (xi, X2) G mop(Xi. mop(X2, . . . , 

Heavy Leaf Path Decomposition Wc construct a heavy leaf path decomposition of P as follows. We 
classify each node as heavy or light. The root is light. For each internal node v we pick a child Vj of v with 
maximum Ip^^.) and classify it as heavy. The remaining children of v are light. An edge to a light node is 
a light edge, and an edge to a heavy node is a heavy edge. The heavy child of a node v is denoted heavy (u). 
Let Idepth(u) denote the number of light edges on the path from v to root(P). 

Note that a heavy leaf path decomposition is the same as the classical heavy path decomposition [21] 
except that the heavy child is defined as the child with largest number of the descendant leaves and not 
the child with the largest number of descendants. This distinction is essential for achieving the linear space 
bound of our algorithms. Note that heavy path decompositions have previously been used in algorithms for 
the related tree edit distance problem [26]. 

Lemma 3 For any tree P and node v G V{P), 

1 < 

'p(ti) :i 2idopth(i)) ■ 

Proof. By induction on Idepth(u). For Idepth(w) = it is trivially true. Let Idepth(u) = £. Assume wlog. 
that V is light. Let w be the unique light ancestor of v with ldepth(i(;) — £—1. By the induction hypothesis 
lp(w) ^ ^p/2^~^. Now V has a sibling heavy(parent(i;)) and thus at most half of the leaves in P(parent(ti)) 
can be in the subtree rooted at v. Therefore, lp(y) < lp[yj)/2 < lp/2^. □ 

Corollary 1 For any tree P and node v G V{P), Idepth(w) < \oglp. 

Notation When we want to specify which tree we mean in the above relations we add a subscript. For 
instance, v <t w indicates that v is an ancestor of w in T. 

3 Computing Deep Embeddings 

In this section we present a general framework for answering tree inclusion queries. As in [25] we solve the 
equivalent tree embedding problem. Let P and T be rooted labeled trees. An embedding of P in T is an 
injective function / : V{P) V{T) such that for all nodes v,u G V{P), 

(i) label(u) = label(/(ti)). (label preservation condition) 

(ii) V ^ u iS f{v) -< f{u). (ancestor condition) 

(iii) V <u \E f{v) <\ f{u). (order condition) 

An example of an embedding is given in Figure [IJc). 

Lemma 4 (Kilpelainen and Mannila [25]) For any trees P and T, P \—T iff there exists an embedding 
of P in T . 
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We say that the embedding / is deep if there is no embedding g such that /(root(P)) -< g(root(P)). The 
deep occurrences of P in T, denoted cmb(P, T) is the set of nodes, 



emb(P, T) ~ {/(root(P)) | / is a deep embedding of P in T}. 

By definition the set of ancestors of nodes in emb(P, T) is exactly the set of nodes {u | P E T{u)}. Hence, 
to solve the tree inclusion problem it is sufhcient to compute emb(P, T) and then, using additional 0(71t) 
time, report all ancestors of this set. We note that Kilpelainen and Mannila [25] used the similar concept 
of left emheddings in their algorithms. A left embedding of P in T is an embedding such that the root of 
P is mapped to the node in T with the smallest postorder number, i.e., the deepest node among the nodes 
furthest to the left. Our definition of emb(P, T) only requires that the root is mapped to a deepest node. 

In the following we show how to compute deep embeddings. The key idea is to build a data structure for 
T allowing a fast implementation of the following procedures. For all X C V{T), Y C V{T) x V{T), and 
a € E define: 

Parent(X): Return the set {parent(a;) | x G X}. 

NCA(r): Return the set {nca(j/i, 2/2) | (2/1, y2) e Y}. 

Deep(X): Return the set {a: € X | there is no 2 e X such that x -< z}. 

MopRiGHT(y, X): Return the set of pairs R such that for any pair (2/1,2/2) € Y, {yi,x) € i? iff (2/2,2^) S 
niop(r|2,X). 

MopLeft(X, y): Return the set of pairs R such that for any pair (2/1,2/2) G Y, (^,2/2) G P iff (2;, 2/1) G 

mop(x,ry. 

Fl(X, a): Return the set DEEP({fl(x, a) | x e X}). 

Collectively we call these procedures the set procedures. The procedures Parent and NCA are self-explanatory. 
Deep(X) returns the set of all nodes in X that have no descendants in X. Hence, the returned set is always 
deep. MopRiGHT and MopLeft are used to iteratively compute minimum ordered pairs. FL{X,a) returns 
the deep set of first ancestors with label a of all nodes in X. If we want to specify that a procedure applies 
to a certain tree T we add the subscript T. With the set procedures we can compute deep embeddings. 
The following procedure Emb(w), v € V{P), recursively computes the set of deep occurrences of P{v) in T. 
Figure [3] illustrates how Emb works on a small example. 

Emb(w): Let vi, . . . ,Vk be the sequence of children of v ordered from left to right. There are three cases: 

1. fc = (w is a leaf). Compute R := FL(P(r), label(w)). 

2. k ~ 1. Recursively compute Pi := Emb(wi). 
Compute R := Fl(Deep(Parent(Pi)), label(i;)). 

3. fc > 1. Let Vj be the heavy child of v. 

Recursively compute Rj := EMB(uj) and set Uj :— {{r,r) \ r € Rj}- 

For i := j + 1 to k, recursively compute P,; := EMB(wi) and Ui := MopRiGHT(t/i_i, P^). 

Set Uj := C/fc. 

For i := j — 1 to 1, recursively compute P,; :— EMB(wi) and Ui :— MopLEFT(Pi, Ui+i). 
Finally, compute P :== Fl(Deep(Nca(C/i)), label(w)). 

If P = stop and report that there is no deep embedding of P{v) in T. Otherwise return P. 

To prove the correctness of the Emb procedure we need the following two propositions. The first propo- 
sition characterizes for node v g V{P) the set emb(w) using mop, nca, and fl. The second proposition shows 
that the set Ui computed in step 3 of the Emb procedure is the set mop(EMB(wi), . . . , EMB(wfe)). 
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Figure 3: Computing the deep occurrences of P into T depicted in (a) and (b) respectively. The nodes in P 
are numbered 1-4 for easy reference, (c) Case 1 of Emb: The crossed nodes are the nodes in the set Emb(3). 
Since 3 and 4 are leaves and labcl(3) = label(4) we have Emb(3) = Emb(4). (d) Case 2 of Emb: The black 
nodes are the nodes in the set Emb (2). Note that the middle child of the root of T is not in the set since it 
is not a deep occurrence, (e) and (f) illustrates the computation of Emb(1) and case 3 of Emb: (e) The two 
minimal ordered pairs of the sets from (d) and (c). In the procedure i?i is the set from (d) and i?2 is the 
set from (c). The set Ui = {{v,v) | v € Ri} and the set U2 = MopRight({/i, i?2) which corresponds to the 
pairs shown in (e). The black nodes in the pairs are the nodes from Ri and the crossed nodes are the nodes 
from i?2 • Since fc = 2 we set C/i = C/2 • (f ) The nearest common ancestors of both pairs shown in (e) is the 
root node of T which is the only (deep) occurrence of P. 
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(«) {b) 



Figure 4: (a) For all i, Wi and Ui are roots of occurrences of P{vi) in T, and w and u is the nearest common 
ancestor of {wi,w^) and (mijMs), respectively. Since wi < wi and M3 < W3 we cannot have u ^ w. (b) For 
all i, is an embedding of P{vi) in T, (wi^w^) is a minimum ordered pair, and w is the nearest common 
ancestor of all the Wi's. The number of leaves in T{w) is at least ^T(wi) ^ ^P(vi)- 

Proposition 1 Let v be a node in P and let vi,...,Vk be the sequence of children of v ordered from 
left to right, where k > 2. For any node w € emb(P(w),r), there exists a pair of nodes {wi,Wk) G 
mop(emb(P(wi), T), . . . , emb(P(i'fc), T)) such that w = fl(nca(?«i, Wfc), label(w)). 

Proof. Since w is the root of an occurrence of P{v) in T there must exist a set of disjoint occurrences of 
P{vi), . . . , P{vk) in T(w) with roots wi <1 . . . < tUfc, such that w is an ancestor oi Wi, . . . ,Wk- Since the Wi's 
are ordered w must be an ancestor of iica{wi,Wk)- Since w is the root of a deep occurrence of P{v) in T it 
follows that w = fl(nca(w;i, Wfc),label(w)). 

It remains to show that we can assume {wi,Wk) S mop(emb(P(wi), T), . . . , emb(P(ufc), T)). It follows 
from the previous discussion that {wi, . . . , Wk) & $(emb(F(wi), T), . . . , emb{P{vk), T)). Assume for the sake 
of contradiction that (wi, Wk) is not a minimum ordered pair. Then there exists a set of disjoint occurrences 
of P{vi), . . . , P{vk) in T{'w) with roots ui <J . . . < Uk, such that either wi < ui <1 Wfc < Wk or wi <ui <iuk <iwk, 
and (wijMfc) G mop(emb(P(ui), T), . . . ,emh{P{vk),T)). Therefore u = fl(nca(ui, Ufc), label(w)) is an embed- 
ding of P{v) in T. Now either w ^ u contradicting the assumption that w is a deep embedding or w = u in 
which case (ui, Uk) satisfies the properties of the lemma (see also FigureHJa)). □ 



Proposition 2 For j + 1 < I < k, 

Ui = mop(EMB(i;j), . . . , Emb{vi)), (1) 

Forl<l<j-l, 

[/; = mOp(EMB(u;), . . . ,EMB(wfc)). (2) 

Proof. We first show Equation ^ by induction on For / — j + 1 it follows from the definition of MopRight 
that Ui is the set of minimum ordered pairs of EMB(wj) and EMB{vj+i), i.e., Ui = mop(EMB(-yj), FjMb{vi)). 
Hence, assume that I > j + I. By the induction hypothesis we have 

Ul = MOPRIGHT([/;_i, EMB(wi)) = MOPRlGHT(mOp(EMB(Wj ) , . . . , Emb(w;_i)), i?;) . 

By definition of MopRight, Ui is the set of pairs such that for any pair (r^, r;_i) G mop(EMB(wj), . . . , Emb(w;_ 
{rj,ri) G Ul iff (r;_i,r;) G mop(mop(EMB(-(;j), . . . ,Emb{vi^i))\^, Ri). By Lemma[T]it follows that {rj,ri) G 
Ul iff irj,ri) G mop(EMB(wj), . . . ,Emb(w;)). 
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We can now similarly show Equation ^ by induction on j' = j — I- By Equation ([T]) we have Uj — 
niop(EMB(i'j), . . . ,EMB(t;fe)) when we begin computing Uj-i. For j' = 1 (/ = j — 1) it follows from the 
definition of MopLeft that Uj-i ~ mop(EMB(i;j_i), EMB(t;j)). Hence, assume that j' > 1. Using LemmaH] 
the Equation follows similarly to the proof of Equation ([T|) . □ 

By Proposition [21 Ui = mop(EMB(wi), . . . , EMB(wfe)). We can now show the correctness of procedure Emb. 

Lemma 5 For trees P and T and node v e V(P), Emb(?;) computes the set of deep occurrences of P{v) in 
T. 

Proof. By induction on the size of the subtree P{v). If ti is a leaf, emb{v, T) is the deep set of nodes in T 
with label labe^w). It immediately follows that emb(ti,T) = FL(L(r), label(w)) and thus case 1 follows. 

Suppose that v is an internal node with fc > 1 children vi, . . . ,Vk- We show that emb(P('y), T) = EMB(t;). 
Consider cases 2 and 3 of the algorithm. 

For fc = 1 we have that w S Emb(w) implies that label(ii;) — label(u) and there is a node wi € Emb(wi) 
such that fl(parent(wi), label(?;)) = w, that is, no node on the path between wi and w is labeled label(u). 
By induction Emb(?;i) = emb(P(ui),T) and therefore w is the root of an embedding of P{v) in T. Since 
Emb(w) is the deep set of all such nodes it follows that w € emb(P(u), T). Conversely, if w G emb(P(u), T) 
then label(z«) = label(?;), there is a node wi £ emb(P(wi),T) such that w -< wi, and no node on the path 
between w and wi is labeled label(w), that is, fl(wi, label(z;)) = w. Hence, w G Emb(u). 

Next consider the case fc > 1. By Proposition [2] and the induction hypothesis 

Ui = mop(emb(P(wi),r), . . . , emb(P(t;fc), T)) . 

We first show that w G emh{P{v),T) implies that w £ Emb(u). By Proposition [T] there exists a pair of 
nodes {wi,Wk) £ mop(emb(P(?;i), T), . . . , emb(P(ufc), T)) such that w — fl(nca(it;i, Wfc), label(t;)). We have 
{wi,Wk) £ Ui and it follows directly from the implementation that w £ Emb(u). To see that we do not loose 
w by taking Deep of NCA(t/i) assume that w' — nca.{wi,Wk) is removed from the set in this step. This 
means there is a node u in Nca(C/i) which is a descendant of w' and which is still in the set. Since w is the 
root of a deep occurrence we must have w — fl(w', label(w)) — fl(u, label(w)). 

Let w £ Emb(w). Then w is the first ancestor with label label(w) of a nearest common ancestor of a pair in 
Ui. That is, label(u;) = label(i;) and there exists nodes {wi,Wk) £ mop(emb(P(wi), T), . . . , emb(P(t;fc), T)) 
such that w — fl(nca(wi, Wfc), label(?;)). Clearly, w is the root of an embedding of P{v) in T. Assume for 
contradiction that w is not a deep embedding, that is, w < u for some node u £ emb(P(u), T). We have just 
shown that this implies u £ Emb(w). Since Emb(w) is a deep set this contradicts w £ Emb(w). □ 

The set L{T) is deep and in all three cases of EMB(t/) the returned set is also deep. By induction it follows 
that the input to Parent, Fl, Nca, and MopRight is always deep. We will use this fact to our advantage 
in the following algorithms. 

4 A Simple Tree Inclusion Algorithm 

In this section we a present a simple implementation of the set procedures which leads to an efficient tree 
inclusion algorithm. Subsequently, we modify one of the procedures to obtain a family of tree inclusion 
algorithms where the complexities depend on the solution to a well-studied problem known as the tree color 
problem. 

4.1 Preprocessing 

To compute deep embeddings we require a data structure for T which allows us, for any v,w £ V{T), to 
compute nceLTiv, w) and determine ii v ^ w or v <lw. In linear time we can compute pre(w) and post(w) for 
all nodes v £ V{T), and with these it is straightforward to test the two conditions. Furthermore, 
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Lemma 6 (Harel and Tarjan [21]) For any tree T there is a data structure using O^ut) space and pre- 
processing time which supports nearest common ancestor queries in 0{1) time. 

Hence, our data structure uses linear preprocessing time and space (see also [2, 7] for more recent nearest 
common ancestor data structures). 

4.2 Implementation of the Set Procedures 

To answer tree inclusion queries we give an efficient implementation of the set procedures. The idea is to 
represent sets of nodes and sets of pairs of nodes in a left-to-right order using linked lists. For this purpose 
we introduce some helpful notation. Let X = [xi, . . . , Xk] be a linked list of nodes. The length of X , denoted 
\X\, is the number of elements in X and the list with no elements is written []. The ith node of X, denoted 
X[i], is Xi- Given any node y the list obtained by appending y to X, is the list X oy = [xi^ . . . ,Xk,y]- If 
for alH, 1 < i < \X\ — 1, X[i] < X[i + 1] then X is ordered and if X[i] < X[i + 1] then X is semiordered. 
Recall that X[i] <X[i + 1] means that we can have X[i] <l X[i + 1] or cither of the nodes can be an ancestor 
of the other {X[i] < X[i + 1] or X[i] ^ X[i + 1] or X[i] >■ X[i + 1]). A list Y = [{xi, Zk), {xk,Zk)] is a 
node pair list. By analogy, we define length, append, etc. for Y. For a pair Y[i] — {xi,Zi) define Y[i]i — Xi 
and Y[i]2 — Zi. If the lists [i^[l]i, . . . , and [1^[1]2, ■ • ■ , ^[fc]2] are both ordered or semiordered then Y 

is ordered or semiordered, respectively. 

The set procedures are implemented using node lists. All lists used in the procedures are either ordered 
or semiordered. As noted in Section [3] we may assume that the input to all of the procedures, except Deep, 
represent a deep set, that is, the corresponding node list or node pair list is ordered. We assume that the 
input hst given to Deep is semiordered and the output, of course, is ordered. Hence, the output of all the 
other set procedures must be semiordered. In the following let X be a node list, Y a node pair list, and a a 
character in E. The detailed implementation of the set procedures is given below. We show the correctness 
in Section 14.31 and discuss the complexity in Section 14.41 

Parent(X): Return the list [parent(X[l]), . . . , parent(X[|X|])]. 

NCA(y): Return the list [nca(y [1]), . . . , nca(y [|r|])]. 

Deep(X): Initially, set x := X[l\ and R := []. 

For i := 2 to |X| do: 

Compare x and X[i]. There are three cases: 

1. X <l X[i]. Set R:^ Ro X and x := X[i]. 

2. x< X{i\. Set X := X[i]. 

3. X[i] :< X. Do nothing. 

Return Ro x. 

The implementation of procedure Deep takes advantage of the fact that the input list is semiordered. In 
case 1 the node X[i] is to the right of our "potential output node" x. Since any node that is a descendant 
of X must also be to the left of X[i\ it cannot appear later in the list X than X[i]. We can thus safely add 
x to i? at this point. In case 2 the node x is an ancestor of X\i] and thus x cannot be in Deep(X). In case 
3 the node X[i] is an ancestor of x and can therefore not be in Deep(X). 

MopRiGHT(y, a:): Initially, set R := []. 

Find the smallest j such that Y[l]2 <\ and set y := y[l]i, x := X[j]. If no such j 
exists stop. 

For i := 2 to \Y\ do: 



11 




Until Y[i]2 <J X[j] or j > \X\ set j := j + 1. 

If j > \X\ stop and return R:= Ro {y,x). Otherwise, compare X[j] and x. There 
are two cases: 

1. If a; <1 X[j] set R:^ Ro {y,x), y := Y[i]i, and x := X[j]. 

2. If a; = X[j] set y := Y\i]i. 

Return R :— Ro (y, x). 

In procedure MopRight we have a "potential pair" {y,x) where y — Y[i']i for some i' and i^[j']2 <1 In 
case 1 we have x <J X[j] and also l^[j']2 <iY[i]2 since the input lists are ordered and i' < i (see Figure [Sja)). 
Therefore, (y, x) is inserted into R. In case 2 we have a; — X[j], i.e., y[j]2 <la;, and as before i^[«']2 <1^[*]2 (see 
FigurelUb)). Therefore (y,x) cannot be in MopRiGHT(y, X), and we set {Y[i]i,x) to be the new potential 
pair. 

We can implement MopLeft(X, F) similarly to MopRight replacing smallest by largest, <J by >, and 
traversing the lists backwards: 

MopLEFT(X,y): Initially, set R := []. 

Find the largest j such that l> X[j] and set y := >^[|i^|]2 and x := X[j]. If no 

such j exists stop. 

For i := \Y\ - 1 to 1 do: 

Until Y[i]i t> X[j] or j < 1 set j := j - 1. 

If J < 1 stop and return R := Ro {y,x). Otherwise, compare X[j] and x. There 
are two cases: 

1. li x> X[i] set R := (x.y) o R, y := Y[i]2, and x := X[j]. 

2. If a; = X[j] set y Y\i]2. 

Return R :^ Ro (x, y). 

Fl{X, a): Initially, set L X , Z := L. 

Repeat until Z — W: 

For i := 1 to \Z\ do: 

1. If label(Z[z]) = a: Delete Z[i] from Z (but keep it in L). 

2. If label(2'[i]) 7^ a and parent(Z[i]) ^ _L: Replace Z[i] with parent(Z[i]) in both 
Z and L. 

3. If label(Z[i]) 7^ a and parent(Z[i]) = _L: Delete Z[i] from both Z and L. 
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Set Z := Deep*(Z,L). 



Return L. 

The procedure Fl computes the set DEEP({fl(a;, a)|a:: £ X\) bottom up. The hst Z contains ancestors of 
the elements of X for which we have not yet found an ancestor with label a. In each step it considers each 
node z in the list Z. If it has the right label then x S Fl(X, a) and we remove it from Z but keep it in L. 
Otherwise we replace it with its parent (unless it is the root). Thus L contains both the elements in Z and 
the part of Fl(X, a) found until now. 

To keep the running time down we wish to maintain the invariant that L is deep at the beginning of each 
iteration of the outer loop. To do this procedure Fl calls an auxiliary procedure Deep*(Z, L) which takes 
two ordered lists Z and L, where Z <Z L, and returns the ordered list representing the set Deep(L) n Z , 
i.e., Deep*(Z, L) = [z e Z\'^x ^ L : z ^ x]. If we use the procedure Deep to calculate Deep* it takes 
time 0{\Z\ + \L\) = 0(|L|). Instead we will show how to calculate it in time 0(|^|) using a linked list 
representation for Z and L. We will need this in the proof of Lemma fT2l which shows that the total running 
time of all calls to Fl from Emb takes time 0{nT)- Below we describe in more detail how to implement Fl 
together with the auxiliary procedures. 

We use a doubly linked list to represent L and extra pointers in this list to represent Z. Each element 
in the list has pointers Succl and Pred^ pointing to its predecessor and successor in L. Similarly, each 
element in Z has pointers Succ^ and Pred^ pointing to its predecessor and successor in Z (right after 
the initialization these are equal to Succ^ and Pred^). In the for loop we use the Succ^- pointers to find 
the next element in Z. To delete Z[i\ from Z in case 1 we set Succz(Predz(.Z[i])) = S\izCz{Z[i\) and 
Predz{S\icCz{Z[i])) — PreAz{Z[i\). The L pointers stay the same. In case 2 we simply replace Z[i\ with its 
parent in the linked list. The Succ and Pred pointers stay the same. To delete Z[i] from both Z and L in 
case 3 we set SucCj(Predj(Z[i])) = S\izc-j{Z[i\) and Predj(SucCj(Z[i])) = Pred-,(Z[i]) for j e {Z,L}. Finally, 
to compute Deep*(Z', L) walk through Z following the Succz pointers. At each node z compare PredL(z) 
and Succl(z) with z. If one of them is a descendant of z remove z from the doubly linked list Z and L as in 
case 3. Note that instead of calling Deep*(Z, L) this comparison can also be done directly in step 2, which 
is the only place where we insert nodes that might be an ancestor of another node in L. We will show in the 
next section that it is enough to compare z to its neighbors in the list L. 

4.3 Correctness of the Set Procedures 

Clearly, Parent and NCA are correct. The following lemmas show that Deep, Fl, and MopRight are 
also correctly implemented. For notational convenience we write x ^ X, for a list X,iix — X[{\ for some i, 
l<i< \X\. 

Lemma 7 Procedure Deep(X) is correct. 

Proof. Let x be the variable in the procedure. We will first prove the following invariant on x: Let i be the 
current index. Right before increasing i in line 2 of the code ("For i := 2 to \X\ do:"), we have x -/< X\j\ for 
any 1 < j < i. 

Induction on i. The base case i = 1 is obviously correct. Let i > 2. Let x' denote the value of the x variable 
before the last iteration. \i x = x' then by the induction hypothesis x 7^ X\2\ for any 1 < j < i — 1. Since 
x was not changed we have ^ a; (case 3 of the procedure) and thus x X\j\ for any 1 < j < i. \i x ^ x' 
then a; = and by the induction hypothesis x' X[j] for any 1 < j < i — 1. Since x' ^ x either x' <\ x 
(case 1 of the procedure) or x' < x (case 2 of the procedure) holds. For a;' ^ a; it follows immediately from 
the induction hypothesis that x for any 1 < j < since all descendants of x also are descendants of 

x' . For a:' O a; we note that is the first node x" occurring after x' in X such that x' <l x" (otherwise 
X would have been reset in case 1 in the iteration where = a;"). Since X is semiordered no node X\j\ 
with smaller index than i can be to the right of x' . Thus no node 1 < j < can be to the right of x' . 

Since all descendants of x must be to the right of x' we have x 7^ X\2\ for any 1 < J < 
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Wo arc now ready to prove that y G Deep(X) iff there exists no z £ X such that y < z. We first argue 
that if 2/ G Deep(X) then '^z € X such that y ~< z. Let y be an element in Deep(X). Only elements 
that have been assigned to x during the procedure are in the output. Consider the iteration where x = y is 
appended to R. This only happens in case 1 of the procedure and thus y = x <\X[i]. Since X is scmiordered 
this implies that x < for i < j < \X\, and therefore y = x X[j] ior i < j < \X\. By the above 
invariant it follows that y = x X[j] for 1 < j < i — 1. Thus if y e Deep(X) then $z G X such that y -< z. 

Let y G X be an element such that X n V{T{y)) — {y}. Let j be the smallest index such that X[j] — y. 
When comparing y and x during the iteration where i = j we are in case 1 or 2, since j is the smallest index 
such that X[j] = y (implying x ^ y) and X fl V{T{y)) = {y} (implying y x). In either case x is set to y. 
Since there are no descendants of y in X, the variable x remains equal to y until added to R. If y occurs 
several times in X we will have x = y each time we meet a copy of y (except the first) and it follows from 
the implementation that y will occur exactly once in R. □ 



To show that the implementation of MopRlGHT is correct we will use the following proposition. 

Proposition 3 Before the first iteration of the for loop we have y = Y[l]i, x = X[j] and either X[j — 1] < 

y[l]2 < X[j] (tfj > 1) or Y[l]2 < X[j] if (j = 1). 

At the end of each iteration of the for loop then, unless Y[i]2 </\X^X\\, we have y = Y[i]i, x = X[j] and 
either X\j - 1] < Y\i]2 < X[j] (if j > 1) or Y[i]2 < X\j] if (j = 1). 

Proof The first statement (y = y[l]i, x = X[j] and either X[j - 1] < Y[l]2 < X[j] (if j > 1) or Y[l]2 < X[j] 
if (j = 1)) follows immediately from the implementation of the procedure line 2 and 3 ("Find the smallest j 
such that ..."). 

We prove the second statement by induction on i. Base case i = 2. By the first statement we have 
y = X = X[j] and cither X[j - 1] <] Y[l]2 < X[j] (if j > 1) or Y[l]2 < X[j] if (j = 1) before this 

iteration. Let j' be the value of j before this iteration. It follows immediately from the implementation that 
y = Y[2]i since y is set to this in both case 1 and 2. If Y[2]2 <\ then j — j'. Since Y is ordered it 
follows that X[j - 1] < Y[l]2 O Y[2]2 O X[j] (if j > 1) or Y[2]2 < X[j] if (j = 1). If Y[2]2 i6X[j'] then j is 
increased until Y[2]2 < X[j] implying X[j — 1] <\ Y[2]2 < X[j] unless j > \X\, since X is ordered. 

Induction step i > 2. It follows immediately from the implementation that y = Y[i]i since y is set to 
this in both case 1 and 2. By the induction hypothesis we have y = Y[i]i, x = X[j] and Y[i]2 < X[j] right 
before this iteration. Let j' be the value of j before this iteration. If Y[i]2 < X[j'] then j = j' . Since Y is 
ordered it follows that X[j ^ 1] < Y[i - Ija < Y\i]2 <iX[j] (if j > 1) or Y\i]2 < X[j] if (j = 1). If Y\i]2 ^X[j'] 
then j is increased until Y[i]2 < X\j] implying ^[j — 1] < F[i]2 < X\j] unless j > □ 



Lemma 8 Procedure MopRight(F, X) is correct. 

Proof. We want to show that for any 1 < i' < \Y\, 1 < j' < \X\: 

{Y[i'],,X\j'])eR ^ {Y[i']2,X\j'])&mop{Y\2,X) 



Since Y\2 and X are ordered lists we have 



{Y[i']2,X\j']) e mop(y|2,X) ^ X[j' - 1] < y[i']2 < X\j'] < Y[i' + 1]2 , (3) 
for i' < \Y\ and j' > 2. We have three border cases 



{Y[i']2,X[f])&mop{Y\2,X) ^ { 



' X[f -l]<Y[i']2<\X[f], fori'=|y| andj'>2 
Y[i']2 <i X[j'] <Y[i' + 1]2, for i' < |y| and j' = 1 (4) 
Y[i']2 < X[j'], for i' = \Y\ and f = 1 
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Thus to show {Y[i']i,X[j']) eR^ {Y[i']2, X[j']) £ mop{Y\2,X) it is enough to show {Y[i']i, X[j']) G i? ^ 
X[j' - 1] < Y[i']2 < X[j'] < Y[i' + 1]2 (plus the border cases). 

We start by showing that if the right hand side of Equation [3] or [H i.e., (F[i']2, £ mop(F|2,^), is 

satisfied then e R. We spht the proof into two cases depending on the value of j': 

• j' = 1: Since the list Y\2 is ordered, x is set to X[l] in line 2 ("Find the smallest j such that . . . ") and 
y is set to Now x and remains the same when we get to the step in the iteration when i = i' in 
the for loop. Thus y is set to Y[i']i (case 1). There are three subcases: 

— i' — 1: Then y is already set to 

— i' = \Y\: Then we end the for loop after this iteration and return R := Ro(y, x) — Ro(Y[i']i, X[l]). 

— i' < \Y\: Then j is increased in the next iteration since X[j] <Y[i' + l]2- 

Therefore {y,x) = (y[i']i, is added to R (case 1 of the procedure or the stop condition in line 2 

in the for loop ("If j > \X\ stop and return R Ro (y, a;).")). 

• j' > 2: Since X is ordered x < X[j'] after the execution of line 2 and 3 in the code ("Find the smallest 
j such that ..."). There are two subcases: 

— i' = 1: We now have x = X[j'] and y = 

— i' > 1: Consider the step in the iteration when i — i'. Since Y\2 and X are both ordered j < j'. 
Now either x = X[j'] (j = j') in which case we set y :— Y[i']i (case 2), or x <J X[j'] (j < j') in 
which case j is increased to j' since X[j' — 1] < Y[i'] O X[j']. In the latter case we set x :— X[j'] 
and y :— Y[i']i. 

We have now argued that after the iteration where i = i' we have x := X[j'] and y :— Y[i']i. li \ Y\ = i' 
then we return R = {y,x). If i' < \Y\ then j is increased in the next iteration since X[j'] < Y[i' + 1]2- 
It follows from the code that when j is increased (y, x) is added to R (either case 1 of the procedure 
or the stop condition in line 2 in the for loop ("If j > \X\ stop and return R -.^ Ro (y, a;).")). 

We will now show that (F e R implies one of the right hand sides of Equation [3] and ID Since 
(y[i']i, £ R we had {y,x) = (F[i']i, at some point during the execution. The pair {y,x) can be 

added to R only in the for loop before changing the values of y and x or at the execution of the last line of 
the procedure. Therefore {y,x) — {Y[i']i, X[j']) at the beginning of some execution of the for loop, or after 
the last iteration {i = \Y\). It follows by Proposition [3] that X[j - 1] <] Y[i]2 <X[j] if j > 1 or Y[i]2 <X[j] if 
j = 1. It remains to show that X[j'] < Y[i' + 1]2 for i' < \Y\. It follows from the implementation that (y, x) 
only is added to R inside the for loop if j is increased. Thus j was increased in the next iteration {i = i' + 1) 
implying X[j'] <] Y[i' + l]2. □ 



Lemma 9 Procedure MopLeft(X, F) is correct. 

Proof. Similar to the proof of Lemma [51 □ 
To show that Fl is correct we need the following proposition. 

Proposition 4 Let X be an ordered list and let x he an ancestor of X[i] for some i G {1, . . . , /c}. If x is an 

ancestor of some node in X other than X[i] then x is an ancestor of X[i — 1] or X[i + 1]. 

Proof. Recall that u<iv iff pre(M) < pre(w) and post(u) < post(u). Since x -< we have pre(a;) < pre(X[i]) 
and post([X[i]) < post(a:). Assume there exists a descendant X[j] of x such that j ^ {i — l,i,i + 1}. If 
J < « — 1 we have 

pre(a;) < pre(X[7]) < pre(X[i - 1]), 



15 



where the first incquaUty follows from x -< X[j] and the second from X being ordered. And 

post{X[i - 1]) < post{X[i]) < post(a;), 

where the first inequality follows from X being ordered and the second from x < X[i]. Thus x < X[i — 1]. 

Similarly, for j > i + 1, we have pre(a;) < pre{X[i]) < prc{X[i + 1]) and post{X[i + 1]) < post(X[j]) < 
post(x) implying that x ^ X[i + 1]. □ 

Proposition |4] shows that the doubly linked list implementation of Deep* is correct. Since all changes to 
the list are either deletions or insertions of a parent in the place of its child, the list L (and thus also Z) is 
ordered at the beginning of each iteration of the outer loop. 

Lemma 10 Procedure FL{X,a) is correct. 

Proof. Let F — {fl(x, a) \ x e X}. We first show that Fl{X, a) C F. Consider a node x G Fl{X, a). Since 
X is in L after the final iteration, x was deleted from Z during some iteration. Thus label(x) = a. For any 
y € X we follow the path from y to the root and stop the first time we meet a node with label a or even 
earlier since we keep the list deep. Thus x E F. 

It remains to show that DEEP(i^) C Fl{X, a). Let a; be a node in Deep(F), let z £ X be the node such 
that X = ii{z, a), and let z = xi,X2, ■ ■ . ,Xk — x be the nodes on the path from z to x. We will argue after 
each iteration of the algorithm we have Xi E L for some i. Since label(a;i) 7^ a for i < fc this is the same as 
Xi € Z for i < k. Before the first iteration we have xi di X = Z . As long as i < fc we replace Xi with x^+i in 
case 1 of the for loop, since label(a;i) ^ a. When i = k we remove Xk from Z but keep it in L. It remains 
to show that we do not delete Xi in the computation of Deep*(Z, L) in any iteration. If Xi is removed then 
there is a node z E L that is a descendant of Xi and thus also a descendant of x. We argued above that 
L\Z C F and thus z G Z since x € Deep(F). But since x G DEEP(i^) no node on the path from 2 to a; can 
have label a and therefore Xi will eventually be reinserted in Z. 

Since Fl{X, a) is a deep set, we have DEEP(i^) C Fl{X, a) C (F) ^ DEEP(i^) = Fl(X, a). □ 



4.4 Complexity of the Set Procedures 

For the running time of the node list implementation observe that, given the data structure described in 
Section [4.11 set procedures, except Fl, perform a single pass over the input using constant time at each 
step. Hence we have, 

Lemma 11 For any tree T there is a data structure using O^ut) space and preprocessing which supports 
each of the procedures Parent, Deep, MopRight, MopLeft, and NCA in linear time (in the size of their 
input). 

The running time of a single call to Fl might take time 0{nT)- Instead we will divide the calls to Fl into 
groups and analyze the total time used on such a group of calls. The intuition behind the division is that 
for a path in P the calls made to Fl by Emb are done bottom up on disjoint lists of nodes in T. 

Lemma 12 For disjoint ordered node lists Xi, . . . , Xk and labels ai, . . . , ak, such that any node in Xi^i is 
an ancestor of some node in FLT{Xi,ai), I < i < k, all of FLT{Xi,ai), . . . ,FLT{Xk,ak) can be computed 
in 0{nT) time. 

Proof. Let Z and L be as in the implementation of the procedure. Since Deep* takes time 0(|-^|) and 
each of the steps in the for loop takes constant time, we only need to show that the total length of the lists 
Z — summed over all the calls — is 0{nT) to analyze the total time usage. We will show that any node in T 
can be in Z at the beginning of the for loop at most twice during all calls to Fl. 

Consider a single call to Fl. Except for the first iteration, a node can be in Z only if one of its children 
were in Z in the last iteration. Note that Z is ordered at the beginning of each for loop. Thus if a node is 
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in Z at the beginning of the for loop none of its children arc in Z and thus in one call to Fl a node can be 
in Z only once. 

Look at a node z the first time it appears in Z at the beginning of an execution of the for loop. Assume 
that this is in the call FL(Xi, a^). If z G Xi then z cannot be in Z in any later calls, since no node in Xj 
where j > « can be a descendant of a node in Xi. If label(2) 7^ then z cannot be in Z in any of the later 
calls. To see this consider the time when z is removed from Z (case 2 or case 3). Since the set L is deep at 
the beginning of the for loop and Z C L, no descendant of z will appear in Z later in this call to Fl, and no 
node in the output from FL(Xi, oii) can be a descendant of z. Since any node in Xj, j > z, is an ancestor of 
some node in FL(Xi, a^) neither z or any descendant of z can be in any Xj, j > i. Thus z cannot appear in 
Z in any later calls to Fl. Now if label(z) — ai then we might have z € Xi^i. In that case, z will appear 
in Z in the first iteration of the procedure call FL{Xi+i,ai), but not in any later calls since the lists are 
disjoint, and since no node in Xj where j > i + 1 can be a descendant of a node in Xi^i. If label(2;) = ai 
and z ^ Xij^i then clearly z cannot appear in Z in any later call. Thus a node in T is in Z at the beginning 
of an execution of the for loop at most twice during all the calls. □ 



4.5 Complexity of the Tree Inclusion Algorithm 

Using the node list implementation of the set procedures we get: 

Lemma 13 For trees P and T the tree inclusion problem can he solved in 0{lpnx) time. 

Proof. By Lemma [TT] we can preprocess T in 0{nT) time and space. Let g{n) denote the time used by Fl 
on a list of length n. Consider the time used by EMB(root(P)). We bound the contribution for each node 
V e V{P). If z; is a leaf we are in case 1 of Emb. The cost of computing Fl(L(T), label(i))) is 0{g{lT)), and 
by Lemma [T^ (with fc = 1) we get 0{g{lT)) = 0{nT). Hence, the total cost of all leaves is OilpriT). If v has 
a single child w we are in case 2 of Emb, and by Lemma [TT] the cost is 0((7(|Emb(w)|)). If v has more than 
one child the cost of MopRight, Nca, and Deep is bounded by X]M,echiid(u) 0(|Emb(w)|). Furthermore, 
since the length of the output of MopRight (and thus Nca) is at most z = min,„g(,hiid(t)) |Emb(w)| the cost 
of Fl is 0{g{z)). Hence, the total cost for internal nodes is, 

J2 o(g{ niiri |Emb(«;)|)+ ^ |Emb(z«)|) = ^ 0{g{\EUB{v)\)) . (5) 
vev{P)\L(p) ^ tuechiid(D) ^ vev(p) 

Next we bound (O. For any w € child(w) we have that EMB(?i;) and Emb(w) are disjoint ordered lists. 
Furthermore we have that any node in Emb(u) must be an ancestor of some node in Fl(Emb(w), label(u)). 
Hence, by Lemma [T^ for any leaf to root path S = vi, . . . ,Vk in P, we have that J2u£S 3(|Emb(m)|) = ©(tt-t)- 
Let A denote the set of all root to leaf paths in P. It follows that, 

J2 5(|EMB(t;)|) < ^ ^g(|EMB(z.)|) = Oilpnr) • 
vev{T) peAuGp 

Since this time is the same as the time spent at the leaves the time bound follows. □ 

To analyze the space used by the algorithm we first bound the size of Emb(w) for each node v G V{P). We 
then use this to bound the total the size of embeddings stored in the recursion stack in the computation of 
Emb (root (P)), i.e., the total size of embeddings stored by recursive calls during the computation. 

Lemma 14 For any tree P we have \/v G V{P): 

It 

|Embt(w)| < 7-^ . 

I-P{v) 
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Proof. By Lemma [S] Emb(w) is the set of deep occurrences of P{v) in T. By the definition of deep the 
occurrences are disjoint and no node in one occurrence can be an ancestor of a node in another occurrence. 
Each occurrence has at least lp(^y) descendant leaves and each of these leaves is an ancestor of at least one 
distinct leaf in T (see also Figure m^b)). Thus the number of occurrences is bounded by It/Ip(v)- D 



Lemma 15 The total size of saved embeddings on the recursion stack at any time during the computation 
o/ EMB(root(P)) is at most 0{It)- 

Proof. Let node v be the node for which we are currently computing Emb. Let p be the path from the root 
to V and let wq, . . . ,wi be the light nodes on this path. We have t = ldepth(t;). There is one embedding 
on the stack for each light node on the path (see Figure : For the heavy nodes on the path there can be 
no saved embeddings in the recursion as the algorithm always recurses on the heavy child first. For each 
light node Wi on the path p except the root wq the stack will contain either EMB(heavy(parent(u;i))), or 
Uj = MopRight([/j_i, where Vj is WiS left sibling, or Uj = MopLEFT([7j_i, where Vj is WiS right 
sibling. The computation of Uj is a series of MopRight (or MopLeft) computations that started with the 
pair of node lists (EMB(heavy(parent(?i;i))), EMB(heavy(parent(u'i)))) as the first argument to MopRight 
(or MopLeft). As the output of MopRight (or MopLeft) can be no larger than the input to the procedure 
we have \Uj\ — 0(|EMB(heavy(parent(u'i))))| and thus the total space used at any time during the recursion 
is 

Idcpth(v) 

0( ^ |EMB(heavy(parent(wi)))|) . 

1=1 

By Lemma [HI we have 

|EMB(heavy(parent(?i'i)))| < 



^P(hcavy (parent 



and thus 

ldcpth('(;) Idcpth(i') ^ 

|EMB(heavy(parent(i«j)))| <It ^ . (6) 

.1^1 i^l 'P(heavy(parent(tOi))) 

By the definition of heavy the node heavy(parent(u'i)) has more leaves in its subtree than Wi, i.e., 

^P{wi) — ^P(hcavy(parcnt(ti!^))) ■ (7) 

Obviously, heavy (parent (ii;^)) has no more leaves in its subtree than its parent, i.e., 

(heavy (parent (tOi ))) — ^ P(pa,Tcnt{wi)) • (8) 

Since Wi is light it has at most half the number of leaves in its subtree as its parent, that is 

/2 . (9) 
Combining this with the fact that Wi is an ancestor of Wi^i and heavy(parent(it;^4_i)) we get, 

^P (heavy (parent (wj )) — ^P (parent (tUj )) 

by ® 

< lp(wj-i} since Wj-i is an ancestor of Wj 

< 'p(parent(tUj_i))/2 by ^ 

< /p(iu^._2)/2 since Wj-2 is an ancestor of parent(i(;j_i) 

< ^P(hcavy(paront(tUj_2)))/2j by ([7]) 
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Figure 6: Path from root to v. The heavy nodes are black and the Ught nodes are white. The heavy edges 
are the thick edges and the Hght edges are thin. 



for any 2 < j < ldepth(i;). Let li = ^p(hoavy(parcnt(tui))) for aU i. To bound the sum in ^ we wiU use that 
k < k-2/2, k < k-i, and ^idcpth(i,) > 1- We have 



Idcpth(u) ldcpth(^J) 



^— 1 2—2,2 odd 

since the liS in the last sum is decreasing with a factor of 2. Combining this with Equation ^ we get 

ldcpth(2j) ldcpth(t;) _^ 

|EMB(heavy(parent(wi)))| <It ^ -, < 4/t • 

1—1 'P(hcavy(paront(ii>i))) 



□ 



Theorem 2 For trees P and T the tree inclusion problem can be solved in 0{lpnT) time and O^ut) space. 

Proof. The time bound follows from Lemma fT3l Next consider the space used by EMB(root(P)). The 
preprocessing of Section |4T1 uses only 0{nT) space. By Lemma [TS] the space used for the saved embeddings 
is O {It) {tit). □ 



4.6 An Alternative Algorithm 

In this section we present an alternative algorithm. Since the time complexity of the algorithm in the previous 
section is dominated by the time used by Fl, we present an implementation of this procedure which leads to 
a different complexity. Define a firstlabel data structure as a data structure supporting queries of the form 
fi{v,a), V e V{T), a G S. Maintaining such a data structure is known as the tree color problem. This is 
a well-studied problem, see e.g. [5,16,18,31]. With such a data structure available we can compute Fl as 
follows. 
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FL{X,a): Return the list DEEP([fl(X[l], a), . . . , fi(X[|X|], a)]). 

Theorem 3 Let P and T he trees. Given a firstlabel data structure using s{nx) space, p{nT) preprocessing 
time, and q^nx) time for queries, the tree inclusion problem can be solved in 0{p{nT) + Iplr ■ qinr)) time 
and 0{s{nT) + JT-t) space. 

Proof. Constructing the firstlabel data structures uses 0{s{nT)) space and 0{p{nT)) time. The total cost 
of the leaves is bounded by 0{lplT ■ qinr)), since the cost of a single leaf is 0{It ■ qinr))- As in the proof 
of Theorem [2] we have that the total time used by the internal nodes is bounded by J2vev{p) 5(|E'Mb(?;)|), 
where g{n) is the time used by Fl on a list of length n, that is, g(n) < n-q{nT). By Lemma [TT] and Lemma [151 
for any leaf to root path 6 = vi, . . . ,Vk in P, we have that X^msiS |Emb(u)| < 0{It). Let A denote the set of 
all root to leaf paths in P. It follows that, 



Several firstlabel data structures are available, for instance, if we want to maintain linear space we have. 

Lemma 16 (Dietz [16]) For any tree T there is a data structure using 0{nT) space, O^ut) expected pre- 
processing time which supports firstlabel queries in O(loglognT) time. 

The expectation in the preprocessing time is due to perfect hashing. Since our data structure does not need 
to support efficient updates we can remove the expectation by using the deterministic dictionary of Hagerup 
et al. [20]. This gives a worst-case preprocessing time of 0(nT log rir)- However, using a simple two-level 
approach this can be reduced to 0{nT) (see e.g. [38]). Plugging in this data structure we obtain. 

Corollary 2 For trees P and T the tree inclusion problem can be solved in 0(1 plx log log + nx) time and 
OiriT) space. 

5 A Faster Tree Inclusion Algorithm 

In this section we present a new tree inclusion algorithm which has a worst-case subquadratic running time. 
As discussed in the introduction the general idea is to divide T into clusters of logarithmic size which we 
can efficiently preprocess and then use this to speed up the computation with a logarithmic factor. 

5.1 Clustering 

In this section we describe how to divide T into clusters and how the macro tree is created. For simplicity in 
the presentation we assume that T is a binary tree. If this is not the case it is straightforward to construct 
a binary tree B, where ns < Sn^, and a mapping g : V{T) — > V{B) such that for any pair of nodes 
v,w & V{T), label(w) ~ \a,he\{g{v)), v ^ w iS g{v) ~< g{w), and v < w iS g{v) <\ g{w). The nodes in the set 
U = V{B)\{g{v) I V G V{T)} are assigned a special label /3 ^ S. It follows that for any tree P, P C T iff 

pep. 

Let C be a connected subgraph of T. A node in V{C) adjacent to a node in V{T)\V{C) is a boundary 
node. The boundary nodes of C are denoted by SC. We have root(T) e SC if root(r) G V{C). A cluster 
of C is a connected subgraph of C with at most two boundary nodes. A set of clusters CS is a cluster 



partition of T iff V{T) = UceCS^(C), E{T) = \Jc^csE{C), and for any Ci, Ca G CS, E{Ci) n ^(Cz) = 0, 



|P(Ci)| > 1- If \SC\ = 1 we call C a leaf cluster and otherwise an internal cluster. 

We use the following recursive procedure Clustert(w, s), adopted from [6], which creates a cluster 
partition CS of the tree T{v) with the property that \CS\ = 0{s) and \V{C)\ < Inr/s'] for each C G CS. 
A similar cluster partitioning achieving the same result follows from [3,4, 19]. 



^ g{\EMB{v)\)< 




^ ^g(|EMB(u)|) < ^ 0{It ■ q{nT)) < 0{IpIt ■ qinr)). 



Since this time is the same as the time spent at the leaves the time bound follows. 



□ 
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CLUSTERT('y, s): For each child u of w there are two cases: 

1. |y(T(u))| + 1 < Ijit/sI . Let the nodes {v} U V{T{u)) be a leaf cluster with boundary 
node V. 

2. |F(T(w))| > Iut/s]. Pick a node w G V{T{u)) of maximum depth such that 
\vIt{u))\+2-\V{T{w))\ < [nr/sl. Let the nodes V {T {u))\V {T (w)) U {v , w} he an 
internal cluster with boundary nodes v and w. Recursively, compute CLUSTER7'(it;, s). 

Lemma 17 Given a tree T with tlt > 1 nodes, and a parameter s, where [tit/s] > 2, we can build a cluster 
partition CS in 0{nT) time, such that \CS\ = 0{s) and \V{C)\ < [n^/s] for any C G CS. 

Proof. The procedure CLUSTER7-(root(T), s) clearly creates a cluster partition of T and it is straightforward 
to implement in 0(nr) time. Consider the size of the clusters created. There are two cases for u. In case 

1, \V{T{u))\ + 1 < [nr/sl and hence the cluster C = {u} U V{T{u)) has size \V(C)\ < {ut/s']. In case 

2, \V{T{u))\ + 2- \V{T{w))\ < Iut/s] and hence the cluster C = V {T {u))\V {T (w)) U {v,w} has size 
\ViC)\<\nT/s]. 

Next consider the size of the cluster partition. Let c = [nr/s]. We say that a cluster C is bad if 
I^IC*)! < c/2 and good otherwise. We will show that at least a constant fraction of the clusters in the cluster 
partition are good. It is easy to verify that the cluster partition created by procedure Cluster has the 
following properties: 

(i) Let C be a bad internal cluster with boundary nodes v and w {v ^ w). Then w has two children with 
at least c/2 descendants each. 

(ii) Let C be a bad leaf cluster with boundary node v. Then the boundary node v is contained in a good 
cluster. 

By (ii) the number of bad leaf cluster is at most twice the number of good internal clusters and by (i) each 
bad internal cluster has two child clusters. Therefore, the number of bad internal clusters is bounded by the 
number of leaf clusters. Let bi and gi denote the number of bad and good internal clusters, respectively, and 
let bi and gi denote the number of bad and good leaf clusters, respectively. We have 

bi < h +gi < 2gi+gi, 

and therefore the number of bad clusters is bounded by 

bi+bi < 2gi + gi + 2gi = Agi + gi . 

Thus the number of bad clusters is at most 4 times the number of good clusters, and therefore at most a 
constant fraction of the total number of clusters. Since a good cluster is of size more than c/2, there can be 
at most 2s good clusters and thus \CS\ = 0{s). □ 

Let C € CS be an internal cluster with v,w € SC. The spine path of C is the path between v, w excluding 

V and w. A node on the spine path is a spine node. A node to the left and right of v or of any node on 
the spine path is a left node and right node., respectively. If C is a leaf cluster with v G SC then any proper 
descendant of w is a leaf node. 

Let CS he a cluster partition of T as described in Lemma 1171 We define an ordered macro tree M. 
Our definition of M may be viewed as an "ordered" version of the macro tree defined in [6]. The node set 
V{M) consists of the boundary nodes in CS. Additionally, for each internal cluster C € CS with v,w G 5C, 

V ^ w, we have the nodes s{v,w), l{v,u!) and r{v,w) and edges {v, s(v,w)), {s{v,w),l{v,w)), (s{v,w),w), 
and {s{v,w),r{v,w)). That is, the nodes l{v,w), r{v,w) and w are all children of s{v,w). The nodes are 
ordered so that l{v,w) <\ w <\ r{v,w). For each leaf cluster C, v G SC, we have the node l{v) and edge 
{v,l{v)). Since root(T) is a boundary node, M is rooted at root(T). Figure [7] illustrates these definitions. 
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Figure 7: The clustering and the macro tree, (a) An internal cluster. The black nodes are the boundary 
nodes and the internal ellipses correspond to the boundary nodes, the right and left nodes, and spine path, 
(b) The macro tree corresponding to the cluster in (a), (c) A leaf cluster. The internal ellipses are the 
boundary node and the leaf nodes, (d) The macro tree corresponding to the cluster in (c). 



With each node v G V{T) we associate a unique macro node denoted c{v). Let u G V{C), where C G CS. 



Conversely, for any macro node i G V{M) define the micro forest, denoted C{i), as the induced subgraph 
of T of the set of nodes {v \ v G V{T),i = c{v)}. We also assign a set of labels to i given by label(i) = 
{label(?;) | v G V{C{i))}. If j is a spine node or a boundary node the unique node in V{C{i)) of greatest 
depth is denoted by first(j). Finally, for any set of nodes {ii, . . . , ifc} C V{M) we define C(ii, . . . , ik) as the 
induced subgraph of the set of nodes V{C{ii)) U • • • U V{C{ik))- 

The following propositions state useful properties of ancestors, nearest common ancestor, and the left- 
to-right ordering in the micro forests and in T. The propositions follow directly from the definition of the 
clustering. See also Figure [8l 

Proposition 5 (Ancestor relations) For any pair of nodes v,w G V{T), the following hold 
(i) If c{v) = c{w) then v <t w iff v <c(c(v)) w- 

(a) Ifc{v) 7^ c{w), and for some boundary nodes v' , w' we have c{v) — s{v',w'), andc(w) G {l(v' ,w'),r(v' ,w')} , 
then V <TW iff v <c(c(w),s(v' ,w'),v') w. 

(Hi) In all other cases, v w iff c{v) c{w). 



l{v) 

c{u) = < s{v, w) 




u 



if u is boundary node, 

if M is a leaf node and v G 5C, 

if M is a spine node, v,w G dC, and v ^ w, 

if M is a left node, v,w G dC, and v ^ w, 

if M is a right node, VjW G dC, and v ^ w. 
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Figure 8: Examples from the propositions. In all cases v' and w' are top and bottom boundary nodes of the 
cluster, respectively, (a) Proposition [Sljii). Here c{v) — s{v',w') and c{w) = l{v',w') (solid ellipses). The 
dashed ellipse corresponds to C{c{w), s{v' ,w'),v'). (b) Proposition[S]Ji) and[7I^ii). Here c{v) = c{w) — l{v' , w') 
(solid ellipse). The dashed ellipse corresponds to C{c{w), s{v' ,'w'),v'). (c) Proposition [6Kii) and[7li). Here 
c{v) = c{w) = l{v') (solid ellipse). The dashed ellipse corresponds to C{c{v),v'). (d) Proposition[6Kiii). Here 
c{v) = l{v',w') and c{w) — s{v',w') (solid ellipses). The dashed ellipse corresponds to C{c{v),c{w),v'). (e) 
Proposition miv). Here c{v) = s{v',w') and c{w) = r{v',w') (solid ellipses). The dashed ellipse corresponds 
to C{c{v),c{'w),v'). (f) Proposition [TKiv) . Here c{v) ~ l{v',w') and c{w) = r{v',w') (solid ellipses). The 
dashed ellipse corresponds to C{c{v),c{w),s{v' ,w'),v'). (g) Proposition [7jv). Here c{v) = l{v',w') (solid 
ellipse) and w' :<m c{w). The dashed ellipse corresponds to C(c(w), s(w', w'), w', w')). 



Case (i) says that if v and w belong to the same macro node then v is an ancestor of w iff w is an ancestor 
of w in the micro forest for that macro node. Case (ii) says that if w is a spine node and w is a left or right 
node in the same cluster then v is an ancestor of w iff w is an ancestor of w in the micro tree induced by 
that cluster (Figure E^a)). Case (iii) says that in all other cases v is an ancestor of w iff the macro node v 
belongs to is an ancestor of the macro node w belongs to in the macro tree. 

Proposition 6 (Left-of relations) For any pair of nodes v,w ^ the following hold 

(i) Ifc{v) = ciw) G {r{v' , w'), l{v', w')} for some boundary nodes v' , w' , then v<lw i/f f <c(c(D),t)^s(tI',to'))W'• 
('^^j If c{v) — c{w) — l{v') for some boundary node v' , then v <\w iff v <c(c(v),v') 

(iii) If c[v) — l{v',w') and c{w) — s(v',w') for some boundary nodes v',w' , then v<iw ijj v <lc(^c{v).c(w),v')^ ■ 

(iv) Ifc{v) = s{v',w') and c{w) = r(u', w') for some boundary nodes v' ,w' , then v<\w iff v<\c{c{v),c{w),v')W. 
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(v) In all other cases, v <i w iff c{v) <Im c(w). 

Case (i) says that if v and w are both either left or right nodes in the same cluster then v is to the left of w iff 

V is to the left of w in the micro tree induced by their macro node together with the spine and top boundary 
node of the cluster (Figure [5]^b)). Case (ii) says that if v and w are both leaf nodes in the same cluster then 

V is to the left of w iff w is to the left of w in the micro tree induced by that leaf cluster (Figure [S^c)). Case 
(iii) says that if w is a left node and w is a spine node in the same cluster then v is to the left of w iff w is to 
the left of w in the micro tree induced by their two macro nodes and the top boundary node of the cluster 
(Figure [81(d)). Case (iv) says that if w is a spine node and w is a right node in the same cluster then v is to 
the left of w iff ti is to the left of w in the micro tree induced by their two macro nodes and the top boundary 
node of the cluster (Figure [8je)). In all other cases v is to the left of w if the macro node v belongs to is to 
the left of the macro node of w in the macro tree (Case (v)). 

Proposition 7 (Nca relations) For any pair of nodes v,w E V{T), the following hold 

(i) If c{v) = c{w) = l{v') for some boundary node v' , then ncaT(f,w) = Ticai,c{c{v},v'){v,w). 

(ii) If c{v) — c{w) € {l{v' ,w'),r{v' ,w')} for some boundary nodes v',w', then 
ncariv^w) = ncac(c(t,),s(i,',to'),t>')(^' 

(iii) If c{v) — c{w) — s{v',w') for some boundary nodes v' ,w' , then ncaT(t', w) = nca(7(f,(^'))(u, w). 

(iv) If c{v) ^ c{w) and c{v), c{w) E {I {v' ,w'),r{v' ,w'), s{v' ,w')} for some boundary nodes v' ,w' , then 
ncariv^w) = n-ca,c(c(v),c{w).s{v' ,w'),v')iv, w) . 

(v) If c{v) 7^ c{w), c{v) e {l{v' ,w'),r{v' ,w'), s{v' ,w')}, and w' <m c{w) for some boundary nodes v',w', 
then ncariv^w) = iicac(c{v),s{v' ,w'},v' ,w')iv,w'). 

(vi) In all other cases, iicaTiv,w) — iica7\/(c(w), c(w)). 

Case (i) says that if v and w are leaf nodes in the same cluster then the nearest common ancestor of v and w 
is the nearest common ancestor of v and w in the micro tree induced by that leaf cluster (Figure [HKc)). Case 
(ii) says that if v and w are both either left nodes or right nodes then the nearest common ancestor of v and 
w is the nearest common ancestor in the micro tree induced by their macro node together with the spine 
and top boundary node of the cluster (Figure [5Kb)). Case (iii) says that if v and w are both spine nodes in 
the same cluster then the nearest common ancestor of v and w is the nearest common ancestor of v and w in 
the micro tree induced by their macro node. Case (iv) says that if v and w are in different macro nodes but 
are right, left, or spine nodes in the same cluster then the nearest common ancestor of v and w is the nearest 
common ancestor of v and w in the micro tree induced by that cluster (we can omit the bottom boundary 
node) (Figure [HJf)). Case (v) says that if f is a left, right, or spine node, and the bottom boundary node 
w' of w's cluster is an ancestor in the macro tree of the macro node containing w, then the nearest common 
ancestor of v and w is the nearest common ancestor of v and w' in the micro tree induced by the macro 
node of v, the spine node, and the top and bottom boundary nodes of v's cluster (Figure [Sj^g)). In all other 
cases the nearest common ancestor of v and w is the nearest common ancestor of their macro nodes in the 
macro tree (Case (vi)). 

5.2 Preprocessing 

In this section we describe how to preprocess T. First build a cluster partition CS of the tree T with clusters 
of size s, to be fixed later, and the corresponding macro tree M in 0{nT) time. The macro tree is preprocessed 
as in Section |4?T] However, since nodes in M contain a set of labels, we now store a dictionary for label(w) 
for each node v G V{M). Using the deterministic dictionary of Hagerup et al. [20] all these dictionaries can 
be constructed in 0{nx^ognx) time and 0{nT) space. Furthermore, we extend the definition of fl such that 
Q.m{v, a) is the nearest ancestor w oi v such that a £ label (w). 

Next we show how to preprocess the micro forests. For any cluster C £ CS, deep sets X,Y,ZC V{C), 
i G N, and a € S define the following procedures. 
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SIZe(X): 



Return the number of nodes in X . 



LEFT(i, X): 



Return the leftmost i nodes in X. 



RlGHT(i, X): 



Return the rightmost i nodes in X. 



leftof(X, y): Return aU nodes of X to the left of the leftmost node in Y. 
rightof(X, y): Return all nodes of X to the right of the rightmost node in Y. 

match(X, y, Z), where X = {mi < • • • < TOfc}, F = {ui < • • • <1 Wfc}, and Z C F. Return R :— {mj \ v-j £ Z}. 



If we want to specify that a procedure applies to a certain cluster C we add the subscript C. In addition to 
these procedures we also define the set procedures on clusters, that is, parent, nca, deep, and fl, as in 
Section [31 Collectively, we will call these the cluster procedures. We represent the input and output sets in 
the procedures as bit strings indexed by preorder numbers. Specifically, a subset X in a cluster C is given 
by a bit string bi . . .bg, such that bi = 1 iff the ith node in a preorder traversal of C is in X. If C contains 
fewer than s nodes we leave the remaining values undefined. 

The procedure SIZe(X) is the number of ones in the bit string. The procedure LEFT{i, X) corresponds 
to setting all bits in X larger than the ith set bit to zero. Similarly, RiGHT(i, X) corresponds to setting all 
bits smaller than the ith largest set bit to zero. Similarly, the procedures leftof(X, F), rightof(X, F), 
MOP(X, Y), and match (X, Y, Z) only depend on the preorder of the nodes and thus only on the bit string 
and not any other information about the cluster. 

Next we show how to implement the cluster procedures efficiently. We precompute the value of all 
procedures, except fl, for all possible inputs and clusters. By definition, these procedures do not depend 
on any specific labeling of the nodes in the cluster. Hence, it suffices to precompute the value for all rooted, 
ordered trees with at most s nodes. The total number of these is less than 2^^ (consider e.g. an encoding 
using balanced parenthesis). Furthermore, the number of possible input sets is at most 2*. Since at most 
3 sets are given as input to a cluster procedure, it follows that we can tabulate all solutions using less 
than 2'^'* • 2^"* = 2^"* bits of memory. Hence, choosing s < 1/10 logn we use 0(22'°sn) _ 0{^Jrl) bits. 
Using standard bit wise operations each solution is easily implemented in 0{s) time giving a total time of 
0[^/n\ogn). 

Since the procedure fl depends on the alphabet, which may be of size ny, we cannot efficiently apply 
the same trick as above. Instead define for any cluster C E CS, X C V{C), and a G S: 

ANCESTOr(X): Return the set {x | a; is an ancestor of a node in X}. 

EQc(a): Return the set {x \ x G V{C), label(a;) — a}. 

Clearly, ancestor can be implemented as above. For EQp note that the total number of distinct labels in 
C is at most s. Hence, EQp can be stored in a dictionary with at most s entries each of which is a bit string 
of length s. Thus, (using again the result of [20]) the total time to build all such dictionaries is 0{nT log nx)- 
By the definition of fl we have that. 



Since intersection can be implemented using a binary and-operation, flc{X, a) can be computed in constant 
time. Later, we will also need to compute union of sets represented as bit strings and we note that this can 
be done using a binary or-operation. 

To implement the set procedures in the following section we often need to "restrict" the cluster procedures 
to work on a subtree of a cluster. Specifically, for any set of macro nodes {ii, . . . , ik} in the same cluster C 
(hence, k < 5), we will replace the subscript C with C{ii, . . . , ik)- For instance, parenT(7(s(^ ,„) ,„))(X) = 
{parent(x) \ x & X Ci V{C{s{v,w),l{v,w))} n V{C{s{v,w),l{v,w)). To implement all restricted versions of 



MOP{X, Y) 



Return the pair (i?i, R2), where i?i — mop(X, Y)\-^ and R2 = mop(X, Y)\2. 



FLc(-'^, a) = deepc(ancestorc(-'^) nEQc(a)). 
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the cluster procedures, we compute for each cluster C E CS a bit string representing the set of nodes in each 
micro forest. Clearly, this can be done in 0{nT) time. Since there are at most 5 micro forests in each cluster 
it follows that we can compute any restricted version using an additional constant number of and-operations. 

Note that the total preprocessing time and space is dominated by the construction of deterministic 
dictionaries which use 0(nT log ny) time and 0{nT) space. 

5.3 Implementation of the Set Procedures 

Using the preprocessing from the previous section we show how to implement the set procedures in sublinear 
time. First we define a compact representation of node sets. Let T be a tree with macro tree M . For 
simplicity, we identify nodes in M with a number almost equal to their preorder number, which we denote 
their macro tree number: All nodes nodes except spine and left nodes are identified with their preorder 
number. Spine nodes are identified with their preorder number + 1 if they have a left node as a child and 
with their preorder number otherwise, and left nodes are identified with their preorder number - 1. Hence, 
we swap the order of left and spine nodes in the macro tree numbering. We will explain the reason for using 
macro tree numbers below. 

Let S C V{T) be any subset of nodes of T. A micro-macro node array (abbreviated node array) X 
representing S is an array of size um- The ith entry, denoted X[i], represents the subset of nodes in C(i), 
that is, X\i] — V^(C(i)) n S. The set X[i] is encoded using the same bit representation as in Section [5721 By 
our choice of parameter in the clustering the space used for this representation is 0{nT / lognx). 

We can now explain the reason for using macro tree numbers to identify the nodes instead of preorder 
numbers. Consider a node array representing a deep set. If a left node and the corresponding spine node 
both are non-empty, then all nodes in the left node are to the left of the node in the spine node. Formally, 

Proposition 8 Consider a node array X representing a deep set X . For any pair of nodes v,w E X , such 
that V e X[i] and w G X[j], i ^ j, we have 

V <i w <^ i < j . 

Proof. By Proposition[6l^v) the claim is true for i <ij. The remaining cases are i = l{v' , w') and j = s{v' , w') 
(Proposition [6] (iii)) and i = s{v',w') and j = r{v',w') (Proposition [6l[iv)). In both cases i < j and it follows 
immediately that v <\w ^ i < j. For the other direction, it follows from the structure of the macro tree that 
in both cases either v <\w w < v. But X is deep and thus v <iw. □ 

Thus, by using macro tree numbers we encounter the nodes in X according to their preorder number in the 
original tree T. This simplifies the implementation of all the procedures except Deep, since they all get deep 
sets as input. 

We now present the detailed implementation of the set procedures on node arrays. As in Section [4] we 
assume that the input to all of the procedures, except Deep, represent a deep set. Let X be a node array. 

Parent(X): Initialize an empty node array R of size um {R[t] '■— for i = 1, . . . um) and set i := 1. 
Repeat until i > um'- 

While X[i\ = do i := i + 1. 

There are three cases depending on the type of i: 

1. i <E {l{v,w),r{v,w)}. Compute 

For each j e {i, s(u, w), w}, set R[j] := R[j] U {N r\V{C{i))). 

2. i — l{v). Compute 

N := PARENTc(t^v}{X\i]) . 

For each j G {i,v}, set R[j] := R[j] U {NnV{C{j))). 
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3. i ^ {l{v,'w),r{v,w),l{v)}. Compute 

N := PARENTc(i){X[{\) . 

If N ^ <D set R[i\ := R[i] U N. Otherwise, if j := parentM(i) -L set R[j] := 
R\j] U {first(i)}. 

Set i:=i + l. 
Return R. 

Procedure Parent has three cases. Case 1 handles the fact that left or right nodes may have a node on a 
spine or the top boundary node as parent. Since no left or right nodes can have their parent outside their 
cluster there is no need to compute parents in the macro tree. Case 2 handles the fact that a leaf node may 
have the boundary node as parent. Since no leaf node can have its parent outside its cluster there is no need 
to compute parents in the macro tree. Case 3 handles boundary and spine nodes. In this case there is either 
a parent within the micro forest or we can use the macro tree to compute the parent of the root of the micro 
tree. Since the input to Parent is deep we only need to do one of the two things. If the computation of 
parent in the macro tree returns a nonempty set, this set is added to the output. Otherwise (the returned 
set is empty), we compute parent of i in the macro tree. If the computation of parent in the macro tree 
returns a node j, this will either be a spine node or a boundary node. To take care of the case where j is a 
spine node, we add the lowest node (first(j)) in j to the output. If j is a boundary node this is just j itself. 

We now give the implementation of procedure NcA. The input to procedure NcA is two node arrays X 
and Y representing two subsets X,y C V{T), \X\ = \y\ = k. The output is a node array R representing 
the set DEEP({nca(A'j, | 1 < i < k}), where and J^, is the ith element of X and y, wrt. their preorder 
number in the tree, respectively. We also assume that we have Xi <\ yi for all i (since NcA is always called on 
a set of minimum ordered pairs). Note, that Xi and yi can belong to different clusters/nodes in the macro 
tree, i.e., we might have Xi € X[i] and yi £ Y[j] where i j. 

Nca(X, Y): Initiahze an empty node array R of size Hm, set i := 1 and j := 1. 
Repeat until i > hm or j > um- 

While =0 doi :=i + l. 
While Y[j] =0 do j := j + 1. 

Set n := min(siZE(X[i]), SlZE(F[j])), Xi := LEFT(n, X[i]), and Yj := LEFT(n, F [j]). 
Compare i and j. There are two cases: 
1. i = j. Set 




C{i,v), ifi = l{v), 

C{i, s{v, 'w),v), if i G {l{v, w),r{v, w)}. 



Compute A'' := NCAs{Xi, Yj). 

For each macro node hin S set R[h] := R[h] U {N n V{C{h))). 
2. I 7^ j. Compute h :— NCAM(«,i)- There are two subcases: 

(a) /i is a boundary node. Set R[h] := 1. 

(b) /i is a spine node s{v,w). There are three cases: 

i. i € {^(w, w), s{v, w)} and j G {s{v, w),r{v, w)}. 
Compute N := NCAc(i,j,siy,w),v){Xi,Yj). 

ii. i = l{v, w) and w ^ j. 

Compute N := NGAc(i,s(v,w),v,w){R^G}iT{l, Xi) , w) . 

iii. j = r{v, w) and w <i. 

Compute N := NCAc(jXv^^)^w^v){w,hEFT{l,Yj)). 
Set R[h] := R[h] U (iV n V{C{h))) and R[v] := R[v] U {N D V{C{v))). 
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Set X[i] \ X, and Y[j] := Y[j] \ Yj. 

Return DEEP(i?). 

In procedure NcA we first find the next non-empty entries in the node arrays X[i\ and Y[j]. We tlicn 
compare the sizes of X[i] and Y[j] and construct two sets of equal sizes Xi and Yj consisting of the 
min(siZE(X[i]), SiZE(y [j])) leftmost nodes from X[i] and Y[j]. In Section [5T^ we prove the following in- 
variant on Xi and Yj 

LEFT(l,Xi) = Xi and left(1, Yj) = yi for some I . 

The procedure has two main cases. If z = j then i is either a leaf, left, or right node due to the invariant and 
the assumption on the input that Xi <J yi (for a formal proof see Section [5^ . If i is a leaf node the nearest 
common ancestors of all pairs in Xi and Yj are in the leaf node or the boundary node. If i is a left or right 
node the nearest common ancestors of all the pairs are in i, on the spine, or in the top boundary node. In 
Case 1 we compute NCA in the appropriate cluster depending on the type of i. 

li i j we first compute the nearest common ancestor h oi i and j in the macro tree (Case 2). Due to 
the structure of the macro tree h is either a spine node or a boundary node (left, right, and leaf nodes have 
no descendants). If /i is a boundary node all pairs in Xi and Yj have the same nearest common ancestor, 
namely h (Case 2(a)). If ft, is a spine node there are three cases depending on the types of i and j. In 
Case 2(b)i i — l{v,w) and j £ {s{v,'w),r{v,w)} (see Figure [5{d) and (f)), ov i — s{v,w) and j = r{v,w) 
(see Figure |SJe)). In this case we compute NCA in the cluster containing i, j, s{v,w),v. In case 2(b)ii i is 
a left node l{v,w) and j is a (not necessarily proper) descendant of w (see Figure [IKg)). In this case we 
compute NCA on the rightmost node in Xi and w in the cluster containing i,v,w, s^VjW). We can restrict 
the computation to RiGHT(l,Xi) because we always run Deep on the output from NcA before using it in 
any other computation and all nearest common ancestors of the pairs in Xi and Yj will be on the spine, and 
the deepest one will be the nearest common ancestor of the rightmost nodes in Xi and Yj (see Section 15.41 
for a formal proof). Case 2(b)iii is similar to Case 2(b)ii. 

In the end of the iteration we have computed the nearest common ancestors of all the pairs in Xi and 
Yj and the nodes from these are removed from X[i] and Y[j]. 

The implementation of Deep resembles the previous implementation, but takes advantage of the fact that 
the input list is in macro tree order. 

Deep(X): Initialize an empty node array R of size um- 

Find the smallest j such that X[j] ^ 0. If no such j exists stop. Set i := j + 1. 
Repeat until i > nm: 

While X\i] = set i := i 1. 

Compare j and i. There are three cases: 

1. j <\ i. Set 

(CU,v), ifj- = /(«), 

5* S C{j,s{v,w),v), if j e {l{v,w),r{v,w)}, 
[C(j), otherwise. 

Set R[j] DEEPs(X[j]). 

2. j < i. There are two cases. 

(a) j = s{v,w) and i = r{v,w). Compute 

N BEEPc(r(v,w),s{v,w),v){X\i] U X[j]) . 

Set R[j] :=X[j]n7V. 

(b) All other cases. Do nothing. 
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3. i -< j (can happen if i = s{v,w) and j = l{v,w)). Compute 

N := DEEPc(i(„,^),s(^,™),„)(X[i] UX[i]) . 

Set R[j] := X[j] D iV, X\i] := X[i] n N. 
Set j := i and i := i + 1. 

Set R[j] := DEEPs{X[j]), where S is set as in Case 1. 
Return R. 

The procedure Deep has three cases. In case 1 node i is to the right of our "potential output node" j. Since 
any node I that is a descendant of j must be to the left oi i {I < i) it cannot appear later in the list X than 
i. We can thus safely add DEEP5(X[j]) to R at this point. To ensure that the cluster we compute Deep on 
is a tree we include the top boundary node if j is a leaf node and the top and spine node if j is a left or 
right node. We add the result to R and set i to be our new potential output node. In case 2 node j is an 
ancestor of i and therefore no node from C{j) can be in the output list unless j is a spine node and i is the 
corresponding right node. If this is the case we compute Deep of X[j] and X[i] in the cluster containing 
i and j and add the result for j to the output and set i to be our new potential output node. In case 3 
node i is an ancestor of j. This can only happen if j is a left node and i the corresponding spine node. We 
compute Deep of X[j] and X[i\ in the cluster containing i and j and add the result for j to the output. 
We also set X [i] to result and let i be our potential output node. The reason for this is that there might be 
nodes later in the input that are descendants of i, but since a left node has no children we can safely add 
the result for j to the output. After scanning the whole node array X we add the last potential node j to 
the output after computing Deep of it as in Case 1. 

We now give the implementation of procedure MopRight. Procedure MopRight takes a pair of node ar- 
rays (X, Y) and another node array Z as input. The pair [X, Y) represents a sot of minimum ordered pairs, 
where the first coordinates are in X and the second coordinates are in F. To simplify the implementation 
of procedure MopRight it calls two auxiliary procedures MopSim and Match defined below. Procedure 
MopSim computes mop of Y and and procedure Match takes care of finding the first-coordinates from 
X corresponding to the first coordinates from the minimum ordered pairs from M. 

MopRight((X, F), Z) Compute M := MopSiM(y, Z). 

Compute R := Match(X, F, My. 
Return (iJ.MIa). 

Procedure MopSim takes two node arrays as input and computes mop of these. 

MopSim(X, y) Initialize two empty node arrays R and S of size rm, set i := 1, j := 1, (ri,r2) := (0,0), 
(si, S2) (0,0). Repeat the following until i > um or j > um- 

While X[i] =0 set i := i + 1. There are 4 cases: 

1. i = l{v, w) for some v, w. Until Y\j] ^ and either i<\j, i = j, or j = s{v, w): set 
J ■■= j + 1- 

II. i = .s{v, w) for some v,'W. Until Y[j] ^ and either i < j ov j = r{v,iu): set 

III. i € {r{v,w),l(v)} for some v,w. Until Y[j] ^ and either i <i j or i = j: set 
j := j + 1. 

IV. i is a boundary node. Until Y\j] ^ and i < j: set j := j + 1 
Compare i and j. There are two cases: 
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1. i<ij: Compare si and j/. If si < j set R[ri] := R[ri]U r2, S[si] :— S[si]U S2, and 
(si,S2) := (j,LEFTco)(l,y[j])). 

Set (ri,r2) := (i, right c (i) (l, X[i])) and z = i + 1. 

2. Otherwise compute (r, s) := MOPc(i,j.u)(^[i], where v is the top boundary 
node in the chister i and j belong to. 

If r ^ do: 

— Compare si and j. If si < j or if si = j and LEFTOFc(ij)(^[«], S2) = then 
set R[n] := i?[ri] U rz, S'[si] := S'[si] U S2. 

- Set (ri,r2) := (i,r) and (si,S2) := (j, s). 
There are two subcases: 

(a) i = joTi = l{v,w) andj = s{v,w). Set X[i] := righTc(j)(1, RiGHTOFc(i)(^[«], ?')) 
and j := j + 1. 

(b) i = s{v, w) and j = r{v, w). If r = set j := j + 1 otherwise set i := j. 

Set R[ri] := R[ri] U and S[si] := S[si] U S2. 
Return {R, S). 

Procedure MopSim is somewhat similar to the previous implementation of the procedure MopRight from 
Section 14.21 As in the previous implementation we have a "potential pair" {{ri,r2), (si, S2)), where ri and 
si are macro nodes, r2 Q -^[''i], S2 Q Y[si], where r2 = {r^ < • • • <1 r*^} and S2 = {s^ <!•••< s*^} such that 
r' < for I — 1, ... k. Furthermore, for any I there exists no node y S Y[j], for j < si, such that r' <] y <] s' 
and no node x E X[i], for i < ri, such that r' <] a; <] s'. 

We have the following invariant at the beginning of each iteration: 

$x e X[i], such that x < x' , for any x' £ r2. (10) 

We first find the next non-empty macro node i. We then have 4 cases depending on which kind of node 
i is. In Case I i is a left node. Due to Proposition [6] we can have mop in i (case (i)), in the spine (case (iii)), 
or in a node to the right of i (case(v)). In Case II i is a spine node. Due to Proposition [6] we can have mop 
in the right node (case (iv)) or in a node to the right of i (case(v)). In Case III i is a right node or a leaf 
node. Due to Proposition [6] we can have mop in i (case (i) and (ii)) or in a node to the right of i (case(v)). 
In the last case (Case IV) i must be a boundary node and mop must be in a node to the right of i. 

We then compare i and j. The case where i <lj is similar to the previous implementation of the procedure. 
We compare j with our potential pair. If si < j then si <\ j since the input is deep, and we can insert r2 
and S2 into our output node arrays R and S, respectively. We also set si to j and S2 to the leftmost node in 
Y[j] (if si = j we already have (si, S2) = {j, LEFT(7(j)(l, F[j]))). Then — both if si < j or si = j — we set ri 
to i and r2 to the rightmost node in X[i]. That we only need the rightmost node in X[i] and the leftmost 
node in Y[j] follows from the definition of mop and the structure of the macro tree. 

Case 2 {i -/Ij j) is more complicated. In this case we first compute mop in the cluster i and j belong 
to. If this results in any minimum ordered pairs (r ^ 0) we must update our potential pair. Otherwise we 
leave the potential pair as it is and only update i and j. If r 7^ we compare si and j. As in Case 1 of 
the procedure we add our potential pair to the output and update the potential pair with r and s if si < j, 
since this implies si < j. If si — j and no nodes in X[i\ are to the left of the leftmost node in S2 we also 
add the potential pair to the output and update it. We show in the next section that in this case |s2| = 1. 
Therefore we can safely add the potential pair to the output. In all other cases the pair (r, s) ^ (0, 0) shows a 
contradiction to our potential pair and we update the potential pair without adding anything to the output. 

Finally, we update X[i], i, and j. There are two cases depending on i and j. In Case (a) either i = j 
or i is a left node and j is the corresponding spine node. In both cases we can have nodes in X[i] that 
are not to the left of any node in Y[j]. These nodes could be in a minimum ordered pair with nodes from 
another macro node. We show in the next section that this can only be true for the rightmost node in X[i\. 
X[i] is updated accordingly. After this update all nodes in Y[j] are to the left of all nodes in X[i] in the next 
iteration and therefore j is incremented. In Case (b) i is a spine node and j is the corresponding right node. 
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Since the input lists are deep, there is only one node in If r = then no node in Y[j] is to the right of 

the single node in X[i]. Since the input arrays are deep, no node later in the array X can be to the left of 
any node in Y[j] and we therefore increment j. li r ^ ^ then (ri,r2) = and we update i. Instead of 

incrementing i by one we set i := j, this is correct since all macro nodes with macro node number between 
i and j are descendants of i, and thus contains no nodes from X, since X is deep. 

When reaching the end of one of the arrays we add our potential pair to the output and return. 

As in Section [42l we can implement MopLeft similarly to MopRight. 



Procedure Match takes three node arrays X, Y, and Y' representing deep sets X, y, and y', where 
\X\ — \y\, and y C y. The output is a node array representing the set {Xj \ yj e 3^'}. 

Match(X, Y, Y') Initialize an empty node array R of size um, set Xl := 0, Yl := 0, Yl := 0, x := 0, y := 0, 
i := 1 and j := 1. 

Repeat until i > um or j > um- 

Until X[i] 7^ set i := i + 1. Set x := SiZE(X[i]). 
Until Y\j] ^ set j j + 1. Set y := SiZE(y[j]). 
Compare Y[j] and Y'[j]. There are two cases: 

1. Y[j] = Y'[j]. Compare x and y. There are three cases: 

(a) x = y. Set R[i] := R[i] U X[i], i := i + 1, and j := j + 1. 

(b) x<y. Set R\i] := R{i] U X\il Y[j] := RiGHT(y - x, Y[j]), Y'[j] := Y[j], and 
i := i + 1. 

(c) X > y. Set Xl := LEFT{y,X[i\), R[i\ := R[i\ U Xl, X[i\ := X[i\ \ Xl, and 
j ■■=J + l- 

2. Y[j] 7^ Y'[j]. Compare x and y. There are three cases: 

(a) x = y. Set R[i] := R[i] U MATCii{X[i],Y[j],Y'[j]), i:=i + l, and j := j + 1. 

(b) x<y. Set Yl := left{x,Y[j]), Yl := Y'[j] D Yl, 

R\i] := R\i]UMATCHiX[i],YL,Y[), Y[j] := Y[j]\YL, Y'[j] := Y'[j]\Yl, and 
i := i + 1. 

(c) X > y. Set Xl := LEFT(y, R[i] := R[i] U MATCii{XL,Y[j],Y'[j]), 
X[i] :^X[i]\Xl, andj :-j + l. 

Return i?. 

Procedure Match proceeds as follows. First we find the first non-empty entries in the two node arrays X[i] 
and Y[j]. We then compare Y[j] and Y'[j]. 

If they are equal we keep all nodes in X with the same rank as the nodes in Y[j]. We do this by splitting 
into three cases. If there are the same number of nodes X[i] and Y[j] we add all nodes in X[i] to the output 
and increment i and j. If there are more nodes in Y[j] than in X[i] we add all nodes in X[i] to the output 
and update Y[j] and Y'[j] to contain only the y — x leftmost nodes in Y[j]. We then increment i and iterate. 
If there are more nodes in X[i] than in Y[j] we add the first y nodes in X[i] to the output, increment j, and 
update X[i] to contain only the nodes we did not add to the output. 

If Y[j] Y'[j] we call the cluster procedure match. Again we split into three cases depending on the 
number of nodes in X[i] and Y[j]. If they have the same number of nodes we can just call match on X[i], 
Y[j], and Y'[j] and increment i and j. If SiZE(F[j]) > SiZE{X[i]) we call match with X[i] the leftmost 
SiZE(X[i]) nodes of Y[j] and with the part of Y'[j] that are a subset of these leftmost SiZE(X[i]) nodes of 
Y[j]. We then update Y[j] and Y'[j] to contain only the nodes we did not use in the call to match and 
increment i. If SiZE(y[j]) < SiZE(A'[i]) we cah match with the leftmost SiZE{Y[j]) nodes of X[i], Y[j], and 
Y'[j]. We then update X[i] to contain only the nodes we did not use in the call to match and increment j. 

FL{X,a): Initialize an empty node array R of size um and two node lists L and S. 
Repeat until i > nm: 
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Until X[i\ 7^ set i := i + 1. 

There are three cases depending on the type of i: 

1. i G {l{v,w),r{v,w)}. Compute 

UN ^(b, set R[j] = R[j] U (iV n V{C{j))) for each j e {i, s{v, w), v}. 
Otherwise, set L :— L o parent^^ (u). 

2. i = l{v). Compute 

N := FLc(^^v){X\i],a) . 

UN ^0, set R[j] R[j] U {N n V{C{j))) for each j e {i, v}. 
Otherwise, set L :— L o parent^(i'). 

3. i ^ {l(y,w),r(v,w),l{vy\. Compute 

N ■.= Fi.c{i){X\i],a) . 

UN ^(b, set R\i] := R\i] U N. 
Otherwise set L :~ L o parent^/ (i). 

Subsequently, compute the list S := flj\/(L,q;). For each node i € S* set := R[i] U 

FLc(j)(first(i),a)). 

Return Deep(_R). 

The Fl procedure is similar to Parent. The cases 1, 2 and 3 compute Fl on a micro forest. If the result 
is within the micro tree we add it to R and otherwise we store in a node list L the node in the macro tree 
which contains the parent of the root of the micro forest. Since we always call Deep on the output from 
Fl(X, a) there is no need to compute Fl in the macro tree if N is nonempty. We then compute Fl in the 
macro tree on the list L, store the results in a hst S, and use this to compute the final result. 

Consider the cases of procedure Fl. In Case 1 i is a left or right node. Due to Proposition [5] case (i) and 
(ii) fl of a node in i can be in i or on the spine or in the top boundary node. If this is not the case it can be 
found by a computation of Fl of the parent of the top boundary node of the i's cluster in the macro tree 
(Proposition [5] case (iii)). In Case 2 i is a leaf node. Then fl of a node in i must either be in i, in the top 
boundary node, or can be found by a computation of Fl of the parent of the top boundary node of the i's 
cluster in the macro tree. If i is a spine node or a boundary node, then fl of a node in i is either in i or can 
be found by a computation of Fl of the parent of i in the macro tree. 

5.4 Correctness of the Set Procedures 

The following lemmas show that the set procedures are correctly implemented. 
Lemma 18 Procedure Parent is correctly implemented. 

Proof. We will prove that in iteration i the procedure correctly computes the parents of all nodes in i. 
There are four cases depending on the type of i. Consider the case i G {l{v,w),r{v,'w)}, i.e., j is a left or 
right node. For all nodes x in C(«), parent(a;) is either in C(i), on the spine s{v, w), or is the top boundary 
node V. The parents of all input nodes in C(i) is thus in TV computed in Case 1 in the procedure. The 
last line in Case 1 ("For each j G {i, s{v, w),v}, . . .") adds the set of parents to the right macro node in the 
output array. 

If i is a leaf node l{v) then for any node x € C(i), parent(x) is either in C{i) or is the boundary node v. 
The parents of all input nodes in C(i) is thus in N computed in Case 2 in the procedure. The last line in 
Case 2 ("For each j G {i, v}, . . .") adds the set of parents to the right macro node in the output array. 
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If i is a spine node s{v,w) then the input contains at most one node in C(i), since the input to the 
procedure is deep. For any x G C{i), parent(a;) is either a node on the spine or the top boundary node v. 
This is handled by Case 3 in the procedure. Let x be the node in X[i]. If parent(a;) = v, then iV = 0, and 
we compute j, which is the parent u of i in the macro tree, and add j to the output array (since j — v is a 
boundary node first(j) = v). If parent(a::) is another node y on the spine, then N — {y} ^ and y is added 
to the output array. 

If z is a boundary node v, then parent('(;) is either another boundary node v' , the bottom node on a spine, 
or _L if is the root. This is handled by Case 3 in the procedure. In all three cases N = <i) and we compute 
the parent j of i in the macro tree. If i is the root, then j = _L and we do nothing. Otherwise, we add 
first(j) to the output. If parent{v) is a boundary node then first(j) = j. If j is a spine node then first(j) is 
the bottom node on j. 

In each iteration of the procedure we might add nodes to the output, but we never delete anything written 
to the output in earlier iterations. Procedure Parent thus correctly computes the parents of all nodes in 
X. □ 



Before proving the correctness of procedure NCA we will prove the following invariant on Xi and Yj in 
the procedure. 

Lemma 19 In procedure NCA we have the following invariant of Xi and Yj : 

left(1, Xi) = Xi and left(1, Yj) — yi for some I . 

Proof. The proof is by induction on the number of iterations of the outer loop. After the while loop on X 
in the first iteration i is the smallest integer such that X[i] ^ 0. Due to the macro tree order of the array 
X, X[i] contains the first nodes from X wrt. the preorder of the original tree (Proposition [8|) . Similarly, 
Y[j] contains the leftmost node in 3^. The invariant now follows immediately from the assignment of Xi and 

For the induction step consider iteration m and let i' and j' be the values of i and j after the while loops 
in the previous iteration. By the induction hypothesis LEFT(l,Xi') = Xi and left(1, Y,/) = yi for some I. 
Let n' = min(siZE(X[z']), SiZE(F[j'])). Then Xj/ contains Xi,. . . ,Xi+n' and Yj> contains We 
will show that left(1, Xi) = Xi+n'+i- In the end of the previous iteration we removed Xi' from X[i']. There 
are two cases. If X[i'] ^ then it clearly contains Xi+n'+i as its leftmost node. Since a spine node can only 
contain one node from X, i' cannot be a spine node. Thus i = i' when we get to the 5th line in the current 
iteration ("Set n :— min(siZE(X[i]), SiZE(y[j])), . . ."). It follows that left(1,Xj) — Xi+n'+i- In the other 
case X[i'] = after the removal of Xi'. It follows from the macro tree order of X that X[i] contains Xi^n'+i 
as its leftmost node. It follows by a similar argument that left(1, Yj) = yi+n'+i- Q 



Lemma 20 Let X and Y be two node arrays representing the deep sets X and y, \X\ ~ \y\ — k, and let Xi 
and yi denote the ith element of X and y, wrt. to their preorder number in the tree, respectively. For all 
i — 1, . . . , k, assume Xi <l yi. Procedure Nca(X,Y) correctly computes DEEP({nca(Ai, 3^i)|l <i< k}). 

Proof. We are now ready to show that the procedure correctly takes care of all possible cases from 
Proposition [7l The proof is split into two parts. First we will argue that some of the cases from the 
proposition cannot occur during an iteration of the outer loop of NCA. Afterwards we prove that the 
procedure takes care of all the cases that can occur. 

Case (iii) cannot happen since if z = j is a spine node then Xi is either a descendant or an ancestor of 
yi contradicting the assumption on the input that Xi <l 3^;. Case (vi) can only happen if i ^ j: If i ~ j and 
we are in case (vi) then i = j is a boundary node, and this would imply that C{i) only consists of one node, 
i.e., Xi — X[i] — Y[j] = 3^; contradicting the assumption on the input that Xi < 3^/. Due to this assumption 
on the input we also have that in case (iv) of the proposition i is either a left node or a spine node and j is a 
spine node or a right node. For case (v) either i is a left node and j is a descendant of the bottom boundary 
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node of i's cluster or j is a right node and i is a descendant of the bottom boundary node of j's cluster. All 
the other cases from case (v) would contradict the assumption that Xi < 3^; . 

The procedure first constructs two sets Xi and Yj containing the elements Xi, . . . , andj^; , ■ • • , yi+n 
for some respectively, where n = min(siZE(X[i]), SiZE(y [j])). The procedure NCA has two main cases 
depending on whether i = j or not. Case 1 (i = j) takes care of cases (i)-(ii) from Proposition [71 Case 2 
{i 7^ j) takes care of the remaining cases from Proposition [7] (iv)-(vi) that can occur. 

First consider Case 1: We compute nearest common ancestors N of the n nodes in Xi and Yj in a cluster 
S depending on what kind of node i is. We need to show that Case 1 handles Case (i) and (ii) from the 
Proposition correctly. 

• Case (i). i = j is a leaf node. By the Proposition the nearest common ancestors of the pairs in 
{Xi,yi), . . . , (A'f+ri, 3^i+n) from Xi and Yj is either in c{i) or in the boundary node, i.e., in C{i,v). 

• Case (ii). z = j is a left or right node. By the Proposition the nearest common ancestors of the pairs 
in {[Xi,yi), . . . , (A;_|_„, 3^i_(_„)} from Xi and Yj is either in c(i), on the spine, or in the top boundary 
node, i.e., in C{i,s(v,w),v). 

Thus S is correctly set in both cases. After the computation of N the output is then added to the entries in 
the output array R for each of the macro nodes in S. Case 1 thus handles Case (i)-(ii) (and only these two 
cases) from Proposition [T] 

Next consider Case 2 {i ^ j). We first compute the nearest common ancestor h oi i and j in the macro 
tree. The macro node h is either a boundary node or a spine node due to the structure of the macro tree 
(see also Proposition [7]). We will show that Case 2 takes care of the remaining cases. 

• Case (iv). From the above discussion it follows that we have one of the three following cases, i = l{v, w) 
and j = s(u, w), i = l{v, w) and j = r{v, w), or i = s(w, w) and j = r{v, w). All three cases are handled 
in Case 2(b)i of the procedure. It follows from the proposition that NCA is computed in the correct 
cluster. 

• Case (v). It follows from the discussion above that either i — l{v,w) and w ^ j, or j = r(v,w) 
and w < i. These two cases are handled by Case 2(b)ii and 2(b)iii of the procedure. It follows 
from the Proposition that NCA is computed in the correct cluster. We need to argue that we can 
restrict the computation of NCA to the pair (right ( 1, Xi), w) instead of computing NCA for all nodes 
in {Xi, . . . , Xi+n}- Consider the case where i — l{v, w) and w < j (Case 2(b)ii of the procedure). Since 
w ^ for all r = ^ + n, and Xi <l Xi^i <\ . . . <\ then nca(AV,3^r) ^ nca(A';+„, 3^;+„) for 
all r = I, . . . Z + n. Thus we do not need to compute nca(<%V, 3^r) for r ^ n + I, since the output of 
the procedure is DEEP({nca(A'i,3^i)|l < i < k}). A similarl argument shows that we can restrict the 
computation to (w, left(1, Y,)) in Case 2(b)iii. 

• Case (vi). It follows from the discussion above and the proposition that i ^ 3 and i and j are in 
different clusters, and we are not in any of the cases from (iv) and (v). Thus h must be a boundary 
node and all the pairs {(A';,3^i), . . . , {Xi+myi+i\)\ have the same nearest common ancestor, namely h. 
This is handled by Case 2(a). 

We have now argued that the procedure correctly takes care of all possible cases from Proposition [T] It 
remains to show that all pairs from {nca(A'i,3^i)|l < i < fc} are considered during the computation. It 
follows from the invariant that we only consider pairs from the input. In the last lines we remove the nodes 
from the input that we have computed the ncas of in this iteration. It follows from the proof of the invariant 
that no entry in the input arrays is left nonempty. Thus all pairs are taken care of. □ 

To prove that procedure Deep is correctly implemented we will use the following fact about preorder 
and postorder numbers in the macro tree. 

Proposition 9 Let i and j be nodes in the macro tree identified by their macro tree number such that i < j. 
For all X € C (i) , y C'(j) we have 



34 



1. pre(a;) < pre(y) unless i = liv^w) and j — s(v,w). 

2. post(y) > post(a;) unless i = s{v,w) and j = r{v,w). 

Proposition 10 Let xi, . . . ,x„ be nodes from the macro tree associated with their macro tree number such 
that xi < X2 < ■ ■ ■ < Xn- If Xi < Xj for some i and j then Xi <\ Xk for all Xk > Xj. 

Proof. From Xi < xj we have pre(a;i) < pre(a:j) and post(xi) < post(a:j). Since Xk > xj we have 
pre(a:j) < pTe{xk) unless Xk = s{v,w) and Xj = l{v,w). In that case, pre(xfe) + 1 = prc{xj) > pre(j:i). Since 
Xi <1 Xj we have Xi ^ Xj and thus pre(a;fc) > pre(a;i). 

It remains to show that post(a;i) < post(a;fe). Assume for the salce of contradiction that post(a;fe) < 
post(a;i) < post(a;j). This implies Xi -< Xk and Xj -< Xk contradicting Xi <l xj. □ 

We will first prove the following invariants on i and j in procedure Deep. 

Lemma 21 After the while loop in line 4 of procedure Deep ("While X[i] = set i := i + 1.") we have the 
following invariant on i and j : For all I such that j < I < i we have X[l] — 0. 

Proof. Let i' be the value of i in the iteration before the previous one. Then i is the smallest index greater 
than i' such that the corresponding entry in X is nonempty. This is true since i is always incremented 
by one in the end of an iteration, and at the beginning of the next iteration we start with the while loop 
incrementing i until we find a nonempty entry. Since j — i' , i is the first nonempty entry greater than j and 
the claim follows. □ 



Lemma 22 Right before each iteration of the main loop of procedure Deep ("Repeat until i > um") we 
have the following invariant on j : For all nodes x € X[j] and y € X[l], where 1 < I < j , we have x -/i y. 

Proof. Recall that x < y pre(x) < pre(?;) and post(?;) < post(a::). By Proposition [9] the only case 
where we can have pre(a::) < pre(y) is if Z = l{v,w) and j = s(v,w) for some v,w. Assume this is the case. 
If X[l] = the claim follows trivially. Otherwise, let i' and j' be the values of i and j in the previous 
iteration, respectively (since I < j and X[l] ^ there must be such an iteration). We have j = I + 1, 
i' = j = s(v,w) and j' = I ^ l(v,w). Thus in the previous iteration the procedure entered case 3, where 
X[i'] was set to X[i']r)DEEPQQ(y_yjyg(y_yjyy^{X[i'](JX[j']), and thus X[j] contains no nodes that are ancestors 
of nodes in = 

□ 



Lemma 23 Procedure Deep is correctly implemented. 

Proof. We will prove that x £ DEEP(Ar) iS x G X and there exists no y G X such that x < y. 

Assume x € Deep(A"). Consider the iteration when x is assigned to the output. There are three cases 
depending on which case we are in when x is added to the input, li j <\ i (Case 1 of the procedure) then 
X £ Deep s{X[j]) and it follows from the invariant on j (Lemma that x has no descendants in any nodes 
y G X[l], I < j . For j < I < i the claim follows directly from Lemma [U It remains to show that x has 
no descendants in X[l] ioi I > i. By Proposition [10] we have j <l I for all / > i and the claim follows from 
Proposition m 

If j ^ i (Case 2 of the procedure) then j is a spine node s(v,w) and i is the corresponding right 
node r{v,w), and we compute N :— DEEPc(r{v,w),s{v,w),v){^[i] U -^[j]). Since x G Deep(A') we have 
X G R[j] — X[j] n N. It follows from the invariant (Lemma and the computation of N that x has no 
descendants in X[l] for any I < j. For / > j it follows from the structure of the macro tree that for any I > i 
we have j <\ I. For j < I < i the claim follows directly from Lemma [2T] The claim follows from Proposition^ 
For j < I < i the claim follows directly from Lemma [21] 
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If i j (Case 3 of the procedure) then i is a spine node s{v,w) and j is the corresponding left node l{v,w), 
and we compute iV :— DEEPc{i{v,w),s{v,w),v)iX[i]UX[j]). Since x e Deep(X) we have x G R[j] — X[j]nN. It 
foUows from the computation of N that x has no descendants in UX[j]. Since l{v,w) has no descendants 
in the macro tree it foUows from Proposition [S] that x has no descendants in X[l] for any I j. 

If X is assigned to the output in the hne next to last ("Set R[j] := Deep s{X[j]), where S is set as in Case 
1") then it follows from the invariant on j fLemma and the computation of DEEP5(X[j]) that x has no 
descendants in X. 

For the other direction let cc S X be a node such that X n V{T{x)) — {x}. Let / be the index such 
that X e X[l]. All nonempty entries in X are i in the second line of the main loop ("Compare i and j..") 
at some iteration. Consider the iteration when i — I. Unless i = l{v,w) and j = s{v,w) (Case 3 of the 
procedure) X[i] is not changed in this iteration. If we are in Case 3, then TV is computed and X[i] is set 
to X[i] n A^. Since x has no descendants in X we have x G N and thus x G X[i] after the assignment. At 
the end of this iteration j is set to i. Consider the next iteration when j = I- li j <J i or i > um then 
X G DEEP5(X[j]) = R[j]. li j ^ i we have j — s{v,w) and i — r{v,w) since x has no descendants in X. For 
the same reason we have x £ N and thus x G X[j] H N = R[j]- If i j we have i = s{v, w) and j = l{v, w). 
Again x e N and thus x e X[j]n N = R[j]. □ 

We now consider procedures MopSim and Match. 

Lemma 24 Let ((^i, r2), (si, S2)) be as defined in procedure MopSim. Then ri and si are macro nodes, 
r2 ^ A'[ri], S2 C ^[si], where r2 = {r^ < • • • <1 r'^'} and S2 — {s^ <] • • • <l s''}. For any I — 1, . . . , k we have 

1. r' < s\ 

2. for all j < si there exists no node y G Y[j] such that r' <l y < s', 

3. for all i < ri there exists no node x G X[i] such that r' <] a; < s'. 

Proof. It follows immediately from the code that ri and si are macro nodes and that r2 C A[ri], S2 C ^[si], 
where r2 — {r^ <J ■ ■ ■ <i r'^^} and §2 — {s^ <] • • • < s*^^}. Due to the macro tree order of the tree and the fact 
that X represents a deep set, no node in X[i] can be to the right of any node in A[ri] for i < ri. To prove 
condition 3 it is thus enough to prove it for i — ri. We proceed by induction on the number k of iterations of 
the outer loop. We consider the time right after the fcth iteration of the loop, i.e., right before the (fc + l)th 
iteration. The base case (fc = 0) is trivially satisfied. 

For the induction step let and for i = 1, 2 be the values of and Si, respectively, after the (fc — l)th 
round. There are 3 cases: 

1. r'2 ~ r2 and = 82- 

2. r'2 7^ r2 and s'2 — 82- 

3. r'2 7^ r2 and s'2 7^ S2 

Let i* and f* be the values of i and f at line 11 ("Compare i and j") in iteration fc. 
Case 1: the claim follows directly from the induction hypothesis. 

Case 2: condition 2 from the lemma follows directly from the induction hypothesis. Since S2 and thus 
also s\ were not changed, r2 was set in case 1 of the procedure and j* = si. Therefore, i* <ij*, ri = i*, and 
\r2\ = 1. Let r2 = {r^} and S2 = {s^}- We have ri = i* <J j* = si and thus <l s^ satisfying condition 1 
from the lemma. To prove condition 3 is satisfied we only have to consider the case i — ri. Since r2 was set 
in case 1 of the procedure, r^ is the rightmost node in X[ri] and it follows immediately that there exists no 
node X G X[ri] such that r^ < x <\ s^. 

Case 3: Let y G Y[j], for < si, be a node such that y ^ S2- We will show that r' is not to the left of 
y. Assume j = si. Since s'2 ^ S2 then there are two cases depending on which case of the procedure the 
potential pair was set in. If it is in case 2 of the procedure the claim follows from the correctness of the 
implementation of mop and the computation (r, s) — mop^j-,; ^ {X[i*], Y[j*]) = mop^^,; ^ {X[i*], Y[si]). If 
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it is in case 1, then ri = i* <Jj* — si. Since S2 is the leftmost node in Y[si] (Hne 2 in case 1 of the procedure: 
"Set (ri,r2) := («, RiGHTc(i)(l, the claim follows. Now assume j < si. We will use that we just 

proved the claim for j = si. Assume for the sake of contradiction that there exists a, y G Y[j] such that 
r' < J/. Since Y is representing a deep set and due to the macro tree order of Y this implies x <\y <\y' for 
all y' e Y[si\ contradicting that the claim is true for j ~ si. □ 



Lemma 25 We have the following invariant at the beginning of each iteration of MopSiM; 

'^x G X[i], such that a; < x', for any a;' G r2. 

Proof. By induction on the number of iterations of the outer loop. In the base case r2 = and the condition 
is trivially satisfied. Note that X is representing a deep set and thus either x <\ x' or x' <\ x for all x G X[i]. 
For the induction step let i' , j' , and r2 be the values of i, j, and r2 respectively in the iteration before this. 
By the induction hypothesis x' <ix for al\x £ X [i'] and x' G r'2- Due to the macro tree order of X and the fact 
that X represents a deep set, all nodes in X[i'] are to the left of all nodes in X[i]. Thus, if r2 = it follows 
from the induction hypotheses that x' <ix for all x £ and x' E r2 — r2. For r2 there are two cases: 
If i' <\i' then r2 = RiGHTc(i/) (1, ^[«']) and i > i' and thus the condition is satisfied. Otherwise r2 was set in 
case 2 of the procedure. Since r2 ^ r'2 we have r2 = r C X[i'] and r 7^ 0. There are two subcases: If « = j or 
i = /(w, w) and j = s{v, w) (Case 2(a) of the procedure) then X[i] either contains a single node, which is the 
rightmost of the nodes in X[i'] that are to the right of all nodes in r2 or if there arc no such nodes X[i] = 0. In 
both cases the condition is satisfied. Ifi = s(v, w) and j — r{v, w) then i > i' and the condition is satisfied. □ 



Lemma 26 Procedure MopSim is correctly implemented. 

Proof. Let X and y be the sets represented by X and y, respectively. We want to show that 

(x, y) e MopSim(X, Y) {x, y) G mop(A', 3^) . 

Assume (x, y) G MopSim(X, Y). Consider the the round where x and y were added to R and 5, respectively. 
We have x — r' E r2 and y — s' E S2. We want to show that there is no node x' G X[i] for any i such that 
X <i x' <i y and no node y' G Y[j] for any j such that x <]y' <\y. By Lemma |24I this is true for i < ri and 
j < si. By the macro tree order of Y we have that y <\y' for any y' G Y[j] when j > si. Let i' be the value 
of i in the round where x and y is added to the output. We will show that no node in X[i'] is to the left 
of any node in S2- Due to the macro tree order of X this implies that no node in X[i\ is to the left of any 
node in S2 for any i > i'. If i' = ri then it follows directly from Lemma [24l If i' > ri it follows from the 
implementation of the procedure that i' is the first non-empty entry in X greater than ri. Thus the claim 
follows for any j. We now return to show that no node in X[i'] is to the left of any node in S2. There are 
two cases depending on whether j = Si or j > si. If j > Si then j was changed either in one of the four 
cases I-IV, or in the previous iteration in case 2. If j was equal to si at the beginning of this iteration then 
j was incremented in one of the four cases I-IV. Thus none of the cases applied to si. By Proposition [5] no 
node in X[i'] can be to the left of a node in X[si]. Since S2 ^ X[si] the claim follows. If j = si it follows 
from case 2 that LEFT(X[i'], S2) = (otherwise the potential pairs would not have been added to the output 
in this iteration) and the claim follows immediately. 

Now assume {x,y) G mop (A", y). We will deal with each of the cases from Proposition [6] separately. 

1. Case (i): c{x) = c{y) = r(v,w). 

2. Case (i): c{x) — c{y) — l{v,w). 

3. Case (ii): c{x) — c{y) = ^(i;). 

4. Case (iii): c{x) — l{v,w) and c{y) — s{v,w). 
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5. Case (iv): c{x) = s{v,'w) and c{y) = r{v,'w). 

6. Case (v): c{x) = l{v,w) and c{y) = r{v,'w). 

7. Case (v): c{x) < c{y) and c{x) and c{y) belong to different clusters. 

Note that if c{x) <\ c{y) then x is the rightmost node in X[c(x)] and y is the leftmost node in Y[c{y)\. We 
first show that in all cases we will have x = r' G r2 and y = s' e S2 for some I at some iteration. Consider 
the first iteration where either x G X[i\ or y G Y\j]. Let i' and j' be the values of i and j, respectively, in 
this iteration. There are three cases: 

(a) x G X[i'] and y G Y[j']. For case 1-5 the procedure goes into case 2. From the correctness of MOP we 
get a; G r and y G s. Thus r 7^ and wc set (ri, = (i', r) and (si, S2) = {]' ■ s) and the claim follows. 
For case 6-7 the procedure goes into case 1. Since this iteration is the first where y G Y\j\ we have 
f > si and we set (ri,r2) = (i', RiGHTc(»') (1, and (31,82) = {j' ,LEFTc(j'){l,Y[j'])). Since x is 
the rightmost node in X[i'] and y is the leftmost node in y[c(j')] the claim follows. 

(b) X G X[i'] and y ^ Y[j']- Since {x,y) G mop{X,y) this implies < c{y) and there exists no node 
y' G such that x < 1/'. Assume that there existed such a y' . Then x <\y' <\y due to the macro 
tree order of Y contradicting {x,y) G mop(A',3^). Thus i' ^ /. From case I-IV of the procedure it 
follows that either i' = j', i' = l{v, w) and j' = s{v, w), or i' = s{v, w) and j' = r{v, w). From this and 
j' < c{y) it follows that we are in case 4 or 7 from above. 

The procedure enters case 2 in this iteration. If we are in case 4 then i' = l{v, w) = j' and c{y) = s{v, w). 
If r = then i = i', X[i'] is unchanged, and j — j' + I ^ s{v, 'w) = c{y) at the end of this iteration. 
If r 7^ then x must be to the right of all nodes in x' G r. Assume that there is a x' G r such 
that X <i x'. Since x' & r there exists a node y' & s such that x' < y' < y. That y' <\ y follows from 
y' G l{v,w) and y G s{v,w) and the assumption that y is deep. Thus x < x' < y' < y contradicting 
that {x,y) G mop(A',3^). Therefore, i = i', x € X[i'] and j = j' + I = s{v,w) = c{y) at the end of 
this iteration. From case I of the procedure and the analysis of case (a) it follows that a; = r' G r2 and 
y = s'' G S2 for some I. 

Now assume we are in case 7. By the same argument as before i = i' , x G X[i], and j > f at the end 
of this iteration. Unless i' = l{v, w) = j' this implies that i <] j at line 11 ("Compare i and j") in the 
next iteration. If i' = l{v,w) = j' then either i <i j after the first loop in the next iteration, and the 
claim follows as before, or i = l{v,iu) and j = s{v,'w). In the last case we get into case (b) again, but 
it follows from the analysis that in the iteration after the next we will have i < j = c{y). The claim 
follows from the analysis of case (a). 

(c) X ^ X[i'] and y G Y[j']. It follows by inspection of the cases that unless we are in case 1 we have i' <\j' . 
If we are in case 1 (/ = c{x) = r{v, w) = c{y)) we have either i' < j' or i' = s{v, w). First we consider 
the cases 2 7. Since i' <i j' the procedure enters case 1 in this iteration. Thus i is incremented and j 
stays the same. This happens until i = c{x). Now consider case 1. If i' <1 j' the procedure enters case 
1 in this iteration. Thus i is incremented and j stays the same. In the next iteration either the same 
happens or i' = s{v,'w). If i' = s{v,w) the procedure enters case 2. Since i' is a spine node and X is 
deep, X[i\ contains only one node x' . By the structure of the macro tree and the assumption that X 
is deep x' <x. Since x <\ y G Y[j'] this implies r ^ 0. It follows from case 2(b) of the procedure that 
i is incremented while j stays the same. At line 11 ("Compare i and j") in the next iteration we will 
have j = i = r{v,'w) since all entries in X between / and r{v,w) are empty due to the assumption 
that X is deep. The claim follows from the analysis in case (a) . 

It remains to show that once x = r' G r2 and y = s' G S2 they will stay this way until added to the output. 
Consider the iteration where x and y are assigned to r2 and 82- At the end of this iteration either i or j 
or both are incremented. Assume j is incremented while the potential pairs are still unchanged. Since j is 
incremented we have Si < j until Si is changed. It follows from case 1 and 2 of the procedure that in this 
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case (ri, is only changed if at the same time (si, S2) are changed and right before that (ri, 7^2) and (si, S2) 
are added to the output. 

Consider first case 1-3. If i is incremented then j is incremented in one the cases I-V in the next iteration 
since i' = j' . By the above argument x and y are added to the output. For case 4 j is incremented (case 2(a) 
of the procedure) and the claim follows as before. For case 5-7 first note that r2 and S2 contain only one node 
each, i.e., x = r2 and y — 82- For case 5 i is incremented (case 2(b) of the procedure). Since X is deep we 
have i > r{v, w) = j' at line 11 ("Compare i and f) in the next iteration. If i > r{v, w) then j > j' and the 
claim follows. If i — r{v, w) the procedure enters case 2. If r = then j is incremented and the claim follows. 
If r ^ then si = j and {x,y) G mop{X,y) implies LEFTc(ij}{X[i], S2) = leftoFc(j .,■) (X[i], y) = 0. Thus 
(ri,r2) and (si,S2) are added to the output. If we are in case 6 and 7, i is incremented. Consider case 6. 
Since (x,y) £ mop(^,3^) all entries in X between l{v,w) and r{v,'w) are empty. Thus at line 11 ("Compare 
i and j") in the next iteration i > r(v,w). The proof is equivalent to the one for case 5. Consider case 7. 
If j is a boundary node then all entries in X between c(x) and j are empty. Thus j is incremented in the 
second loop of the next iteration. For all other cases for j the proof is similar to the proof of case 5. □ 



Lemma 27 In procedure Match we have the following invariant of X[i] and Y[j] after line 2 in the main 
loop ("Repeat until"): 

left(1, X[i]) = Xi and left(1, Y[j]) = yi for some I . 

Proof. Induction on the number of iterations of the outer loop. Base case: In the first iteration X[i] and 
Y[j] are the first nonempty entries in X and Y and thus LEFT(l,X[i]) ~ X\ and LEFT(l,y[j]) = y\. For 
the induction step let i' and j' be the values of i and j in the previous iteration. By the induction hypothesis 
left(1, X[z']) — Xy and left(1, y[j']) = y^ . \i x = \X\i'\\ ~ both i and j were incremented and 

LEFT(l,X[i]) = Xvj^x and LEFT(l,y[j]) = yv+x- If a; = \X\i'\\ < \Y[j']\ then i was incremented implying 
LEFT(l,X[i]) = Xi>+x- In that case j = j' and Y[j] = left{x, Y[j']) implying LEFT(l,y[j]) = yi'+x- Sim- 
ilarly, if \X[i']\ > \Y[j']\ = y we have left(1, X[i]) = Xi'+y. In that case j = j' and left(1, Y[j]) = yi'+y □ 



Lemma 28 Procedure Match is correctly implemented. 

Proof We need to show that for edl 1 < k < \X\: Xk € Match(X, Y, Y') ^ Xk & {Xj\yj e 3^}. Consider 
the iteration where Xk G X[i] and yk G Y[j]. By Lemma \T7\ such an iteration exists. If Y[j] — Y[j'] then 
yk G y implying Xk G {Xj\yj G y}. It follows from the implementation of case 1(a) and 1(b) that if x < y 
all nodes in X[i] are added to the output and thus Xk G Match(X, F, Y'). U x > y then Xk G LEFT(y, X[z]) 
since yk G Y[j] and thus Xk G Match (X, Y,Y'). 

If Y[j] ^ Y'[j] the procedure calls match with some subset of X[i], Y[j], and Y'[j] depending on the 
size of X and y. By Lemma [57] and the correctness of match it follows that Xk G Match (X, Y, Y') -i^ Xk G 

{x,\y,Gy}. □ 



Lemma 29 Procedure MopRight is correctly implemented. 

Proof. Follows from the correctness of MopSim (Lemma [26l) and Match (Lemma [28l) . □ 
Finally, we consider correctness of the Fl procedure. 

Lemma 30 Procedure Fl is correctly implemented. 

Proof. Let X denote the set represented by X and let F ~ {R(x, a)\x G X}. To show Fl{X, a) C F we will 
first show that for any node x added to R during the computation x G F. Consider a node x G R[i] for some 
i. Either x was added directly to R after a computation of TV in one of the three cases of the procedure or it 
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was added after the computation of S. In the first case x € F follows from the correctness of Flc . If x was 
added after the computation of S it follows from the correctness of Flm that x £ C{i) for some i E S. Due 
to the correctness of Fhc we have x G F. 

To show Deep(F) C FL{X,a) we use Proposition [51 Let a; be a node in Deep(F) and let x' be a node 
in X such that ii{x',a) = x. We have x' G X[i] for some i. If z is a left or right node then according to 
Proposition [5] a; can be in i (case (i)), on the spine (case (ii)), in the top boundary node (case (ii)), or in an 
ancestor of i in the macro tree (case (iii)). If x is in the same cluster as x' then it follows from the correctness 
of FLc that X € N. Thus x is added to R and due to the correctness of Deep we have x £ Fl(X, a). If c(x) is 
in a different cluster than c{x') then c{x) is an ancestor of c{x') in the macro tree due to Proposition [5l Since 
X € Deep(F) we have N = % and thus parent(w) <m c{x') is added to L. It follows from the correctness of 
Flm that c{x) £ S. Due to the structure of the macro tree c(x) is either a boundary node or a spine node 
and thus x — flc(c(x))(first(c(x)), a) = FLq {c{x)){^'<^st{c{x)) , a). The last equality follows from the correctness 
of FLc- That X G Fl(X, a) now follows from the above analysis showing that only nodes from F are added 
to R and the correctness of Deep. 

If j is a leaf node then x can be in i (case (i)), in the top boundary node (case (iii)), or in an ancestor 
of i in the macro tree (case (iii)). The correctness follows by an analysis similar to the one for the previous 
case. If i is a spine node or a boundary node, then x is either in i (case (i)) or in an ancestor of i in the 
macro tree (case (iii)). The correctness follows by an analysis similar to the one for the first case. □ 



5.5 Complexity of the Tree Inclusion Algorithm 

To analyze the complexity of the node array implementation we first bound the running time of the above 
implementation of the set procedures. All procedures scan the input from left-to-right while gradually 
producing the output. In addition to this procedure Fl needs a call to a node list implementation of Fl on 
the macro tree. Given the data structure described in Section 15.21 it is easy to check that each step in the 
scan can be performed in 0(1) time giving a total of 0{nT / logny) time. Since the number of nodes in the 
macro tree is 0(riT/ logn^), the call to the node list implementation of Fl is easily done within the same 
time. Hence, we have the following lemma. 

Lemma 31 For any tree T there is a data structure using O^nx) space and ©(n^logriT) preprocessing time 
which supports all of the set procedures in Oinx/ lognx) time. 

Next consider computing the deep occurrences of P in T using the procedure Emb of Section[3]and LemmalST] 
The following lemma bounds the space usage. 

Lemma 32 The total size of the saved embeddings at any time during the computation o/ EMB(root(P)) is 
Oinr). 

Proof. Let v be the node for which we are currently computing Emb. Let p be the path from the root to v 
and let wq, . . . ,wi be the light nodes on this path. We have / = Idepth(u). As in the proof of Lemma [T5l it 
suffices to bound |EMB(heavy(parent(wi)))| for all i. Assume that Ip < It (otherwise we can check this in 
linear time and conclude that P cannot be included in T). Each of the node arrays use ©(nT/logn^) space 
and therefore by Corollary [1] we have that J2i=i |EMB(heavy (parent (wi)))] — 0{n/ logur ■ log/p) — 0{nT). 
□ 

For the time complexity note that during the computation of EMB(root(P)) each node v € V{P) con- 
tributes a constant number of calls to the set procedures. Hence, the total time used by the algorithm is 
OiripUT I logriT + riT logn^). Thus we have shown the following. 

Theorem 4 For trees P and T the tree inclusion problem can he solved in O^npnx / logUT -\- nxlognT) 
time and 0{nT) space. 

Combining the results in Theorems [2l |4] and Corollary [2] we have the main result of Theorem [T] 
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6 Conclusion 



We have presented three algorithms for the tree inclusion problem, which match or improve the best known 
time complexities while using only linear space. We believe that some of the new ideas are likely to be 
of both practical and theoretical value in future work. From a practical perspective, space is a common 
bottleneck for processing large data sets and hence reducing the space can significantly improve performance 
in practice. From a theoretical perspective, we have introduced several non-trivial algorithms to manipulate 
sets of nodes in trees that may have applications to other problems. For instance, the NCA procedure from 
Section [5] computes multiple nearest common ancestor queries in time sublinear in the size of input sets. 
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